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TO WHOM IT MAY CONCERN: 

Be it known that I, Carl Alexander Kamb, a citizen of the United States, 
residing at 696 Donner Hill Circle, Salt Lake City, in the County of Salt Lake and 
State of Utah; I, Sanghee Yoo, a citizen of Korea, I, Miguel Garcia-Guzman, a 
citizen of Spain, and I, Michael Leslie Pierce, residing at 25 11 East 2860 South, 
Salt Lake City, in the county of Salt Lake and State of UT, have invented a new 
and useful "B-catenin pathway assays, and compositions therefrom," of which the 
following is a specification. 



1 B-catenin Assays, and Compositions Therefrom 

2 This application claims priority from and is a continuation-in-part of 08/812,994, 

3 now issued as U.S. Patent No. 5,955,275, and U.S. Application No. 60/253,325 

4 (VEN009/00/P12, filed November 27, 2000), the entire disclosures of which are 

5 specifically incorporated by reference herein in their entireties. 

6 Field of the Invention 

7 The present invention relates to certain nucleic acid sequences, amino acid 

8 sequences, other compositions and methods relating to the characterization and 

9 physiologic implications of ji-catenin/TCF pathways. 

10 Background of the Invention 

1 1 Colorectal cancer is the second leading cause of cancer-related deaths in the 

12 United States, being responsible for as many as 60,000 fatalities each year. Nearly five 

13 percent of the US population develops colorectal cancer, and this number is predicted to 



14 rise as the average life expectancy increases (Beart, R.W. (1991) American Cancer 

15 Society Textbook of Clinical Oncology. Atlanta, American Cancer Society, pg. 213-218). 

16 Colon cancer erupts from the lumenal surface of the colon and rectum. Normally, 

17 epithelial cells line the surface of the colon and invaginate into structures called crypts. 

18 Over the course of three to six days, stem cells located at the base of each crypt divide, 

19 and then differentiate as they migrate toward the apex where they die and are released 

20 into the lumen (see, for example, Lipkin, M. et al. (1963) "Cell proliferation kinetics in 

21 the gastrointestinal tract of man." J Clin Invest 42:767). The first manifestations of 

22 colorectal cancer are often observed clinically as a polyp; a mass of epithelial cells that 

23 protrude from the apex of the colonic crypts of the bowel wall (see, for example, Kent, 

24 T.H. et al. (1983) "Polyps of the colon and small bowel, polyp syndromes, and the polyp 

25 carcinoma sequence." in Norris HT (eds) Pathology of the Colon, Small Intestine, and 

26 Anus. New York, Churchill Livingstone, vol 2, pg 167). Polyps are, predominantly, 

27 divided into two classes. The nondysplastic form consists of a large mass of cells that 

28 have normal morphology. These aggregates line up in a single row along the basement 

29 membrane and exhibit a low frequency of becoming neoplastic. The second form of 

30 polyp is the adenomatous polyp. These formations are dysplastic in nature and exhibit an 

31 abnormal intracellular and intercellular organization. As tumor progression evolves, 



1 adenomatous polyps exhibit a high frequency of metastasis to surrounding tissues with 

2 the most common sites of invasion being the mesenteric lymph nodes, the peritoneal 

3 surface, and the liver. 

4 Genetic studies have shown that at least two forms of heritable colorectal cancers 

5 exist. The first group, which includes familial adenomatous polyposis coli (FAP), Peutz- 

6 Jegher syndrome, familial juvenile polyposis, Cronkhite-Canada syndrome and 

7 hyperplastic polyposis, is characterized by the appearance of multiple (hundreds to 

8 thousands) of benign, precursor, colorectal polyps. In addition to colorectal lesions, 

9 several of these afflictions are associated with manifestations in other tissues including 

10 soft tissue tumors, osteomas, dental abnormalities, congenital hypertrophy of the retinal 

1 1 pigment epithelium (CHRPE), and cancers of the thyroid, small intestine, stomach, and 

12 brain (see, for example, Giardiello, KM. (1995) "Gastrointestinal Polyposis syndromes 

13 and hereditary nonpolyposis colorectal cancer" in Rustgi AK (eds.) Gastrointestinal 

14 Cancers:Biology, Diagnosis, and Therapy. Philadelphia, Lippincott-Raven, pg. 361 -311 \ 

15 Hamilton, S.R. et al. (1995) "The molecular basis of Turcot' s syndrome." New England 

16 Medicine 332:839)). In contrast, patients with hereditary nonpolyposis colorectal 

17 cancer (HNPCC) lack an increase in the number of precursor adenomas yet share an 

18 increased risk for other ailments including cancer of the uterus, ovary and brain. 

19 Breakthroughs in molecular biology have identified several of the genes/gene families 

20 involved in the initiation and progression of colon cancer (see, for example, Vogelstein, 

21 B. et al. (1988) "Genetic alterations during colorectal-tumor development." New England 

22 /. Medicine 319:525; Kinzler, K. and Vogelstein, B. (1998) "Colorectal Tumors" in The 

23 Genetic Basis of Human Cancer. McGraw-Hill). In addition to identifying genes 

24 involved in DNA mismatch-repair (hMSHl , hMSH2, HPMS 1 , hPMS2), cell growth 

25 (e.g.the oncogenes K-ras, H-ras or N-ras), and cell cycle regulation (e.g tumor 

26 suppressors, p53), a growing body of evidence has suggested that a connection exists 

27 between cancer, cell adhesion and the Wnt/Wng pathway. Specifically attention has been 

28 drawn to a collection of gene products that include, but are not limited to, APC 

29 (adenomatous polyposis coli), 0-catenin, TCF/LEF, GCK3, and cadherin. 

30 The importance of APC in colorectal cancer was initially hypothesized when it 

3 1 was observed that roughly fifty percent of colorectal tumors exhibited cytologically 
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1 recognizable alterations of chromosome 5q, the position of the APC gene (see, for 

2 example, Kinzler, K.W. and Vogelstein, B. (1996) "Lessons from hereditary colorectal 

3 cancer." Cell 87:159-170). Since then, molecular analysis has shown that either APC or 

4 the gene encoding P-catenin are mutated at a high frequency in patients with FAP, 

5 sporadic colorectal tumors (see, for example, Ashton-Rickardt, P.G. (1989) "High 

6 frequency of APC loss in sporadic colorectal carcinoma due to breaks clustered in 5q21- 

7 22." Oncogene 4: 1 169; Sparks et al. (1998) " Mutational Analysis of the APC/p-catenin 

8 /TCF pathway in colorectal cancer." Cancer Res. 58: 1 130-1 134) and other malignancies 

9 and diseases including, but not limited to, melanoma, hepatocellular carcinoma, ovarian 

10 cancer, endometrial cancer, medulloblastoma pilomatricomas, prostate cancer and 

1 1 Alzheimers (see, for example, Morin, P. J. (1999) "P-catenin signaling and cancer." 

12 Bioessays 21:1021-1030, Barker, N. et al. (2000) "The Yin-Yang of TCF/p-catenin 

13 Signaling." Adv in Cancer Res. 77:1-2; De Ferrari, G.V. and Inestrosa N.C. (2000) "Wnt 

14 signaling function in Alzheimer's disease." Brain Res Brain Res Rev. 33(1): 1-12). 

15 Studies of the Wnt/Wng pathway have provided evidence for the role of APC in 

16 regulation of p-catenin. In the absence of a Wnt/Wng signal, a quaternary cytoplasmic 

17 complex comprising APC, p-catenin, glycogen synthase kinase-3p (GSK-3P) and 

18 conductin/axin is formed and mediates the phosphorylation and, consequently, targeted 

19 destruction of P-catenin via the ubiquitin proteasome pathway. Key to the formation of 

20 this complex is the association between APC and p-catenin; an interaction that is 

21 mediated by a 20 amino acid repeat found in the APC molecule (Figure 1, also, see, Su, 

22 L.K. et al. (1993) "Association of the APC tumor suppressor protein with catenins." 

23 Science 262(5 140): 1734-7). Natural mutations in this 20 a.a. sequence have been 

24 observed in many colon cancer lines and are associated with elevated levels of free p- 

25 catenin. In addition, it has been observed that reintroduction of wild type APC into these 

26 cells results in a dramatic down regulation of the free, cytoplasmic form of p-catenin thus 

27 suggesting that APC plays a critical role in the modulation of free, cytoplasmic p-catenin 

28 (Munemitsu, S. et al (1995) "Regulation of intracellular p-catenin levels by the 

29 adenomatous polyposis coli (APC) turmor-suppressor protein " PNAS 92:3046-3050). 
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1 APC's ability to down-regulate cytoplasmic p-catenin levels is dependent on its 

2 interaction with two negative regulators of the Wnt/Wng pathway: GSK-3p and 

3 conductin. Biochemical studies have shown that GSK-3P phosphorylates both p-catenin 

4 and APC, and that these events lead to an increase in APC's affinity to p-catenin and the 

5 eventual destruction of the molecule (Munemitsu, S. et al (1995)). Further evidence for 

6 the role of GSK-3p in p-catenin stability comes from the observation that loss of GSK-3p 

7 activity leads to increased levels of cytoplasmic P-catenin (He, X. et al. (1995) 

8 "Glycogen synthase kinase-3 and dorsoventral patterning in Xenopus embryos." Science 

9 374(6523):617-22). Conductin's interaction with p-catenin was initially identified by 

10 two-hybrid interaction assays. Subsequently, this molecule was shown to have additional 

1 1 binding sites for APC and GSK-3p and is now believed to act as a docking molecule or 

12 platform for the APC, p-catenin, GSK-3P complex. 

13 While the absence of a Wnt/Wng signal leads to the destruction of p-catenin, 

14 activation of the pathway counteracts the negative regulation of P-catenin. In response to 

15 the Wnt/Wng signal, the GSK 3p- binding protein (GBP) binds to GSK-3p and prevents 

16 phosphorylation of the relevant targets, APC and P-catenin. As a result of these events, 

17 cytoplasmic levels of free P-catenin are elevated, thus promoting the interaction between 

18 p-catenin and a family of transcription factors called TCFs (T-cell factors). 

19 The connection between cell adhesion and cancer has been speculated upon for several 

20 years. Tumor cells making the transition from benign lesions to malignant, invasive 

21 cancers, need to overcome cell-cell adhesion in order to invade surrounding tissues. 

22 Several studies have observed that the expression of cell adhesion molecules, including 

23 E-cadherin, are lost or altered during the development of many human cancers 

24 (Birchmeier W. and Behrens J. (1994) "Cadherin expression in carcinomas: role in the 

25 formation of cell junctions and the prevention of invasiveness. " Biochim Biophys Acta, 

26 1 198(1): 1 1-26) and that co-expression of a dominant negative form of E-cadherin can 

27 result in the development of adenomas in mice (see, for example, Michelle, L. et al. 

28 (1995) "Inflamatory Bowel Disease and Adenomas in Mice Expressing a Dominant 

29 Negative N-Cadherin." Science 270: 1203-1207). In many instances, forced reintroduction 

30 of wildtype adhesion molecules into invasive tumors results in the reversion to the 
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1 benign, non-invasive phenotype (Frixen ,U.H. et al. (1991) "E-cadherin-mediated cell- 

2 cell adhesion prevents invasiveness of human carcinoma cells." J Cell Biol 1 13(1): 173- 

3 85) again suggesting a strong correlation between cancer and cell adhesion. 

4 The bridge between APC/p-catenin and cell adhesion comes from both 

5 histological observations and molecular data. Studies in wild-type colorectal epithelial 

6 cells have shown that APC expression gradually increases as cells migrate to the top of 

7 the colonic crypt, and peaks just prior to when the cells undergo apoptosis and slough off 

8 into the lumen of the colon (see, Miyashiro, L et al (1995) "Subcellular localization of the 

9 APC protein: Immunoelectron microscopic study of the association of the APC protein 

10 with catenin." Oncogene 1 1:89). Furthermore, loss of just a single copy of the APC 

1 1 gene results in a decrease of the enterocyte crypt-to-villus migration (Mahmoud, N.N. et 

12 al. (1997) "Ape gene mutation is associated with a dominant-negative effect upon 

13 intestinal cell migration." Cancer Res 57(22):5045-50). Since loss of cells from the top 

14 of crypts is an important homeostatic process, it has been suggested that disruptions in 

15 cellular adhesion via mutations in APC and APC -interactive proteins may lead to 

16 alterations in normal cell growth regulation and/or apoptosis (see, for example, 

17 Wijnhoven B.P. (2000) "E-cadherin-catenin cell-cell adhesion complex and human 

18 cancer." Br J Surg. 87(8):992-1005; Kim, K. et al, (2000) "Overexpression of beta- 

19 Catenin Induces Apoptosis Independent of Its Transactivation Function with LEF-1 or 

20 the Involvement of Major GlCell Cycle Regulators." Mol Biol Cell 1 1(10):3509-3523; 

21 Weihl, C.C. (1999) "The role of beta-catenin stability in mutant PS 1 -associated 

22 apoptosis." Neuroreport 10(12):2527-32). Evidence supporting such a model come from 

23 the discovery that P-catenin associates with the cytoplasmic tail of cadherins (see 

24 Kemler, R. (1993) "Cytoplasmic protein interactions and regulation of cell adhesion." 

25 Trends in Genetics 9:317). Studies have shown that over-expression of C-cadherin alters 

26 the Wnt/Wng signaling cascade and that these alterations are mediated through the region 

27 of cadherin that interacts with P-catenin (see, for instance, Fagotto, F. et al. (1996) 

28 "Binding to Cadherins Antagonizes the Signaling Activity of p-catenin during Axis 

29 Formation in Xenopus." J of Cell Biology, 132: 1 105-1 1 14). Given that binding of P- 

30 catenin to APC and cadherins is mutually exclusive (both bind to a set of repeated 
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1 elements called "armadillo repeats") it is possible that APC/p-catenin fulfill some facet of 

2 their tumor suppressor function by modulating cell adhesion in this fashion. 

3 Despite detailed knowledge of these and other genes involved in colorectal 

4 cancer, the art to date has not provided an efficient method for exploring the biological 

5 intricacies of colon cancer and identifying new putative therapeutic drugs for the 

6 prevention and treatment of this disease. Prophylactic colectomies are still routinely 

7 performed on FAP patients as a preferred method to reduce the risk of cancer and patients 

8 with metastatis disease usually receive radiation and/or cuirent, broad-acting 

9 chemotherapeutic agents. Although such treatments can induce temporary remissions, 

10 they are often not curative, as evidenced by the fact that approximately 40% of the colon 

1 1 cancer patients die from the disease within 5 years. The present invention provides an 

12 opportunity to identify new drugs and drug targets that can be utilized to battle the 

13 increasing incidence of colon cancer that is predicted for the upcoming decade. 

14 Brief Summary of the Invention 

15 The present invention relates to activity of P-catenin-related pathways, as well as 

16 to compositions therefrom. More specifically, the present invention generally relates to 

17 methods for assessing p-catenin pathway-related activity, and from such methods, 

18 obtaining perturbagens with p-catenin-related activity. Such perturbagens then are used 

19 to obtain p-catenin-related targets, which in turn can be used to identify potential 

20 therapeutics. The invention also provides genetic material for the development of gene 

21 therapy agents, vectors and host cells. 

22 The present invention provides polypeptides of cadherin perturbagens V, VI and 

23 XI, biologically active fragments, analogs and modifications thereof, and polypeptides 

24 consisting essentially of such perturbagen sequences. In other aspects, the invention 

25 provides polypeptides having at least 99%, at least 95%, at least 90%, at least 85% or at 

26 least 80% sequence identitity or homology with such perturbagens, and in other aspects 

27 provides N- and C-terminal fragments of such perturbagens. The invention further 

28 provides a composition of such polypeptides in a pharmaceutically acceptable carrier, and 

29 for treating a p-catenin-related condition with a therapeutically effective amount of a 

30 polypeptide of the invention. 
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1 The present invention also provides polypeptides having p-catenin pathway 

2 activity that are fused to heterologous sequences, in some aspects a scaffold or more 

3 particularly, a fluorescent protein scaffold, and provides polypeptides having P-catenin 

4 pathway activity that are chemically modified, or more particularly, radiolabeled, 

5 acetylated, glycosylated, or fluorescently tagged. Antibodies to the polypeptides of the 

6 invention also are provided. 

7 The present invention further provides polynucleotides encoding cadherin 



8 perturbagens V, VI and XI, biologically active fragments, analogs and modifications 

9 thereof, and polypeptides consisting essentially of such perturbagen sequences. In other 

10 aspects, the invention provides polynucleotides encoding polypeptides having at least 

1 1 99%, at least 95%, at least 90%, at least 85% or at least 80% sequence identity or 

12 homology with such perturbagens, and in other aspects provides polynucleotides 

13 encoding N- and C-terminal fragments of such perturbagens. In some aspects, the 

14 polynucleotides are chemically synthesized. 



15 The present invention further provides host cells, vectors, and gene therapy 

16 vectors comprising the polynucleotides of the invention. The host cells of the invention 

17 further provide for methods for producing p-catenin-related polypeptides by culturing 

18 such host cells and recovering such polypeptides. 

19 The present invention also provides methods for identifying a cellular target that 

20 interacts with the polypeptides of the invention. In some aspects, the method is 

21 performed in vitro and comprises detecting reporter expression, and in particular aspects, 

22 utilizes a yeast two-hybrid assay format. The present invention further provides for the 

23 use of such target in screening for putative P-catenin-related therapeutics, and in some 

24 aspects screens for disruption of polypeptide-target pairs. In particular aspects, a 

25 combinatorial chemical library is so screened. 

26 Brief Description of the Drawings 

27 Figure Legends 

28 Figure 1. The Wnt/Wng pathway and proposed interactions with cadherin. The 

29 interaction of the Wnt/Wng molecule with its receptor leads to activation of GBP (GSK- 

30 3p binding protein) and inhibition of P-catenin degradation by the ubiquitin pathway. As 

31 a result, cytoplasmic p-catenin levels increase, leading to formation of the B-catenin-Tcf 
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1 complex and heightened transcription of genes carrying a Tcf binding element. Increased 

2 levels of free p-catenin also lend themselves to B-catenin-is rapidly degraded via the 

3 ubiquitin pathway. 

4 Figure 2. A. Mapping the functional region of a perturbagen. Four perturbagens are 

5 derived from different breakpoints within the same gene. By mapping the smallest 

6 sequence that is common to all four perturbagens (dotted line) it is possible to identify 

7 biologically critical regions (black box). B. Critical regions of a gene can be determined 

8 by deletion analysis. For instance, a series of N-terminal deletions (dotted line) can be 

9 tested for biological activity. In this example, full activity requires a molecule that is 

10 longer than deletion 2 but smaller than deletion 1. 

1 1 Figure 3. Isolation of a p-catenin/Tcf reporter line. A population of cells containing the 

12 TBE-GFP reporter construct and the dominant allele of p-catenin (P-cat S45Y) undergoes 

13 multiple rounds of FACS to enrich for bright (GFP + ) cells. Subsequently, cells are plated 

14 at low density to isolate individual clones. Samples of each clone are then transduced 

15 with the dominant negative allele of Tcf4 (Tcf4 A30). Cell lines that are responsive to the 

16 dominant effects of Tcf4 A30 are identified by FACS and the parent clone is recovered 

17 by returning to the original plate (i.e. stripped wells). 

18 Figure 4. A perturbagen library is introduced into the S4535 clone and screened by 

19 FACS to isolate dim clones. Genomic DNA prepared from the dim cells is then used to 

20 PCR amplify the perturbagen encoding inserts in each cell. Each insert is then subcloned 

21 into the original vector (pVT352. 1) and reinfected into a fresh population of S4535 cells 

22 for further enrichment. 

23 Figure 5. Basic two-hybrid methodology. When bait and prey molecules interact, the 

24 Gal4- AD and Gal40-BD binding domains of the Gal4 transcriptional activator are 

25 reconstituted. As a result, this functional unit can associate with the Gall UAS and 

26 induce transcription of the reporter gene (Leu2). 

27 Figure 6. Four-Hybrid System. Host cell RNA targets are identified through a four- 

28 hybrid modification of the original two-hybrid scheme. Expanded region (lower left) 

29 pictures interaction between "bait" and "target" RNA molecules. 

30 Figure 7. LANCE™. In the homogeneous assay, a Cy5 labeled perturbagen binds to an 

31 Eu-Target molecule in solution. A. When the two molecules are in close proximity, the 
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1 emissions of the lanthanide chelate can excite Cy5 and give rise to a robust signal. B. In 

2 the presence of a small molecule inhibitor, the Cy5-perturbagen-Target-Eu interaction is 

3 prevented. Subsequent excitation of Eu results in little or no signal. 

4 Figure 8. DELFIA™ . In the heterogeneous assay, the target is immobilized to a solid 

5 support using an Eu labeled monoclonal antibody. Following incubation with the Cy5 

6 labeled perturbagen, the well is washed to remove unbound Cy5. Due to the close 

7 proximity of the Eu and Cy5 moieties in the bound complex, excitation of the lanthanide 

8 chelate leads to excitation (and emission) of Cy5. In the presence of a small molecule 

9 inhibitor (black circles), the Eu-target and Cy5-perturbagen moieties never come in 

10 close proximity. In subsequent washes, the free, unbound, Cy5-peptide conjugate is 

1 1 removed and the Eu induced Cy5 signal is insignificant. 

12 Figure 9. Construction of the full length P-catenin clone. 

13 Figure 10. Construction of the full length Tcf clone. 

14 Figure 11. Histogram comparing the fluorescent distribution of i) HEK293 cells, ii) 

1 5 HEK293 cells with the TBE2 x 4 -GFP reporter, iii) HEK293 cells with the TBE2 x 4 - 

16 GFP reporter and p-cat S37F, iv) HEK293 cells with the TBE2 x 4 -GFP and p-cat S45. 

17 The y-axis represents cell number. The x-axis represents fluorescence intensity. 

18 Figure 12. Responsiveness of Clone S4535. A. Histogram of HEK 293 cells. 

19 B. Histogram of Clone S4535. C. Clone S4535 with Tcf 4 A30. 

20 Figure 13 -15. Cadherin Perturbagens. The DNA and peptide sequences of perturbagens 

2 1 are listed. In some cases a second DNA sequence indicates the reverse strand (labeled R) 

22 of the perturbagen insert. Penetrance for each clone is also displayed. Isoforms of 

23 cadherin are identified. 

24 Figure 16. Histogram of S4535 cells containing CadV. 

25 Figure 17. Bar Graph showing effects of TcfDN and Cad5CD on dead cell number in 

26 HT29 cells, HEK293 cells, HMEK cells, and HUVEC's. 

27 Figure 18. Bar Graph showing effects of TcfDN and Cad5CD on total cell number in 

28 HT29 cells, HEK293 cells, HMEC cells, and HUVEC's. 

29 Figure 19. Histogram of cells used in gene profiling studies. 

30 Figure 20. List of target genes and/or Est sequences that are altered in both Tcf4DN and 

3 1 Cad5CD transformed lines. The "ratio" refers to numbers derived from calculations made 
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1 by Incyte Genomics. "Z" values were determined using calculations described in Kamb 

2 et al. 

3 Figure 21. List of target genes and/or Est sequences that are altered in Cad5CD 

4 transformed lines. 

5 Figures 22-24. Diagrams of vectors used in these studies. 

6 Definitions 

7 The terms "perturbagen" or "phenotypic probe" refers to an agent that is 

8 proteinaceous or ribonucleic in nature and acts in a transdominant mode to interfere with 

9 specific biochemical processes in cells, i.e., through its interaction with specific cellular 

10 target(s) or other such component(s), capable of disrupting or activating a particular 

1 1 si gnaling pathway and/or cellular event. Perturbagens may be encoded by a naturally 

12 derived library of compounds such as a cDNA or genomic DNA (gDNA) expression 

13 library, or an artificial library comprising synthetic oligonucleotide sequences of a 

14 desired length or range of lengths, e.g. a random peptide library. Alternatively, the 

15 perturbagen itself can be synthesized using chemical methods. The term "proteinaceous 

16 perturbagen" encompasses peptides, oligo- or polypeptides, proteins, protein fragments, 

17 or protein variants. Some proteinaceous perturbagens can be as short as three amino 

18 acids in length. Alternatively, these agents can be greater than 3 amino acids but less 

19 than ten amino acids. Other agents can be greater than ten amino acids but shorter than 

20 30 amino acids in length. Still other agents can be greater than 30 amino acids but less 

21 than 100 amino acids in length. Still other agents can be greater than 100 amino acids in 

22 length. Naturally occurring proteinaceous perturbagens (i.e. those derived from cDNA or 

23 genomic DNA) exhibit a range in size from as little as three to several hundred amino 

24 acids. In contrast, synthetic perturbagens (such as those present in a synthetic peptide 

25 library) may range in size from three amino acids to fifty amino acids in length and more 

26 preferably, from three to 20 amino acids in length, and yet more preferably, about 15 

27 amino acids in length. Similarly, the length of RNA perturbagens can vary. Some RNA 

28 perturbagens are as short as 6-10 nucleotides in length. Other RNA perturbagens are 

29 between 10 and 50 nucleotides in length. Still other RNA perturbagens are between 50 

30 and 200 nucleotides in length. Other RNA perturbagens are greater than 200 nucleotides 

31 in length. 
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1 The term "mimetic" refers to a small molecule that (i) exerts the same or similar 

2 physiological or phenotypic effect in a bioassay system or in an animal model as does a 

3 given perturbagen, or (ii) is capable of displacing a perturbagen from a target in a 

4 displacement assay. 

5 The term "small molecule" refers to a chemical compound, for instance a peptide 

6 or oligonucleotide that may optionally be derivatized, natural product or any other low 

7 molecular weight (less than about 1 kDalton) organic, bioinorganic or inorganic 

8 compound, of either natural or synthetic origin. Such small molecules may be a 

9 therapeutically deliverable substance or may be further derivatized to facilitate delivery. 

10 The term "target" refers to any cellular component that is directly acted upon by 

1 1 the perturbagen that leads to and/or induces the phenotypic change, detectible for 

12 example in a bioassay system. 

13 The terms "library" or "genetic library" refer to a collection of nucleic acid 



14 fragments that may individually range in size from about a few base pairs to about a 

15 million base pairs, with typical expression libraries of about nine base pairs to about ten 

16 thousand base pairs. These fragments are generated using a variety of techniques familiar 

17 to the art. 



18 The term "sublibrary" refers to a portion of a genetic library that has been isolated 

19 by application of a specific screening or selection procedure. 

20 The term "insert" in the context of a library refers to an individual DNA fragment 

21 that constitutes a single member of the library. 

22 The terms "reporter gene" and "reporter" refer to nucleic acid sequences or 



23 encoded polypeptides for which screens or selections can be devised. Reporters may be 

24 proteins capable of emitting light, or genes that encode intracellular or cell surface 

25 proteins detectable by antibodies. Preferably, the reporter activity may be evaluated in a 

26 quantitative manner. Alternatively, reporter genes can confer antibiotic resistance or 

27 selectable growth advantages. 

28 The term "gene" refers to a DNA substantially encoding an endogenous cellular 

29 component, and includes both the coding and antisense strands, the 5' and 3' regions that 

30 are not transcribed but serve as transcriptional control domains, and transcribed but 
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1 untranslated domains such as introns (including splice junctions), polyadenylation 

2 signals, ribosomal recognition domains, and the like. 

3 The terms "polynucleotide" or "nucleic acid molecule" are used interchangeably 

4 to refer to polymeric forms of nucleotides of any length. The polynucleotides may 

5 contain deoxyribonucleotides, ribonucleotides and/or their analogs. Nucleotides may 

6 have any three-dimensional structure, and may perform any function, known or unknown. 

7 The term "polynucleotide" includes single-, double-stranded and triple helical molecules. 

8 "Oligonucleotide" refers to polynucleotides of between 5 and about 100 nucleotides of 

9 single- or double-stranded DNA. Oligonucleotides are also known as oligomers or oligos 

10 and may be isolated from genes, or chemically synthesized by methods known in the art. 

1 1 The following are non-limiting embodiments of polynucleotides: a gene or gene 

12 fragment, exons, introns, mRNA, tRNA, rRNA, ribozymes, cDNA, recombinant 

13 polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any 

14 sequence, isolated RNA of any sequence, nucleic acid probes and primers. A nucleic 

15 acid molecule may also comprise modified nucleic acid molecules, such as methylated 

16 nucleic acid molecules and nucleic acid molecule analogs. Analogs of purines and 

17 pyrimidines are known in the art, and include, but are not limited to, aziridinycytosine, 4- 

18 acetylcytosine, 5-fluorouracil, 5-bromouracil, 5-carboxymethylaminomethyl-2-thiouracil, 

19 5-carboxymethyl-aminomethyluracil, inosine, N6-isopentenyladenine, 1-methyladenine, 

20 1-methylpseudouracil, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2- 

21 methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, pseudouracil, 5- 

22 pentylnyluracil and 2,6-diaminopurine. The use of uracil as a substitute for thymine in a 

23 deoxyribonucleic acid is also considered an analogous form of pyrimidine. 

24 The term "fragment" refers to any portion of a proteinaceous perturbagen that is 

25 at least 3 amino acids in length, or any RNA molecule that is at least 5 nucleotides in 

26 length. The descriptors "biologically relevant" or "biologically active" refer to that 

27 portion of a protein or protein fragment, RNA or RNA fragment, or DNA fragment that 

28 encodes either of the two previous entities, that is responsible for an observable 

29 phenotype, some portion of an observable phenotype, or for activation of a correlative 

30 reporter construct. 
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1 The term "variant" refers to biologically active forms of the perturbagen sequence 

2 (or the polynucleotide sequence that encodes the perturbagen) that differ from the 

3 sequence of the initial perturbagen. 

4 The terms "homology" or "homologous" refers to the percentage of residues in a 



5 candidate sequence that are identical with the residues in the reference sequence after 

6 aligning the two sequences and introducing gaps, if necessary, to achieve the maximum 

7 percent of overlap (see, for example, Altschul, S.F. et al. (1990) "Basic local alignment 

8 search tool." / Mol Biol 215(3):403-10; Altschul, S.F. et al. (1997) "Gapped BLAST and 

9 PSI-BLAST: a new generation of protein database search programs." Nucleic Acids Res 

10 25(17):3389-402). It is understood that homologous sequences can accommodate 

1 1 insertions, deletions and substitutions in the nucleotide sequence. Thus, linear sequences 

12 of nucleotides can be essentially identical even if some of the nucleotide residues do not 

13 precisely correspond or align. The reference sequence may be a subset of a larger 

14 sequence, such as a portion of a gene or flanking sequence, or a repetitive portion of a 

15 chromosome. 



16 The term "scaffold" refers to a proteinaceous or RNA sequence to which the 

17 perturbagen is covalently linked to provide e.g., conformational stability and/or 

1 8 protection from degradation . 

19 Detailed Description of the Preferred Embodiments 

20 Agents isolated from the methods described herein have broad potential and 



21 application. Among others, the invention permits the definition of disease pathways, the 

22 identification of diagnostically or therapeutically useful targets, and the identification of 

23 therapeutic agents. For example, P-catenin/TCF pathway-related genes that are mutated, 

24 up-regulated, or down-regulated under disease conditions may be involved in causing or 

25 exacerbating the disease condition. Treatments directed at modulating the activity of 

26 such genes or treatments that involve alternate pathways may ameliorate the disease 

27 condition. Also, the agents and assays described herein thus have utility as models for 

28 diseases related to the p-catenin/TCF pathway. The assays may be utilized as part of a 

29 screening strategy designed to identify agents such as compounds that are capable of 

30 ameliorating disease symptoms. 
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1 As more fully disclosed herein, the described methodology yields first a set of 

2 RNA-based or proteinaceous agents, and second, a set of endogenous cellular targets. 

3 Each RNA-based or proteinaceous agent (or a mimetic, agonist or antagonist thereof 

4 identified through, e.g., routine small molecule screens) may be useful as a direct 

5 therapeutic agent in the treatment of cancer and/or various diseases. With each such new 

6 agent, a corresponding target molecule can be readily identified using standard 

7 interaction methodologies such as the two-hybrid technique. Such targets are useful in 

8 the development of novel drugs for new chemotherapeutic strategies and may provide 

9 useful diagnostic tools for profiling the genetic background (genotype) of the particular 

10 disease under study. 

11 A. Overview of the Invention 

12 The invention describes the isolation of new and previously unidentified agents 

13 that alter the sensitivity of a cell to the activation of the (3catenin/TCF pathways. 

14 The perturbagens described herein were isolated using a phenotypic assay. See priority 

15 document U.S. Patent No. 5,955,275, "Methods for identifying nucleic acid sequences 

16 encoding agents that affect cellular phenotypes," the disclosure of which is incorporated 

17 by reference herein in its entirety. Briefly, the assay identifies agents that alter a cell's 

18 responsiveness to an activated form of p-catenin (e.g. p-catenin S45T, or p-catenin 

19 S37F). To accomplish this, a library of polynucleotide sequences is generated using a 

20 variety of techniques familiar to the art. After ligating this material into a standard 

21 expression vector, the library is transferred into a population of cells of a given type (e.g. 

22 a cell line) and screened for sequences that induce a particular biological phenotype. The 

23 assay advantageously identifies one or more relevant sequences from the library in the 

24 selected host cell population. Cells expressing a biologically relevant perturbagen induce 

25 a particular phenotype (or correlative activation of a reporter gene), and are then 

26 separated from the rest of the population using, e.g., high-throughput Fluorescent 

27 Activated Cell Sorting (FACS) screening procedures. Such high-throughput machines 

28 are both highly sensitive and efficient (obtaining screening speeds of approximately 

29 10,000 to approximately 65,000 cells or more per minute) thus facilitating identification 

30 of biologically relevant sequences that exist at low frequencies within a cell population. 
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1 Here, an assay has been designed to identify molecules that alter a cell's ability to 

2 respond to a mutated form of p-catenin (P-catenin Serine45Tyrosine, also referred to 

3 herein as p-catenin S45Y or p-cat S45Y). To accomplish this, a random primed library 

4 constructed from cDNA was transfected into a subline of the HEK 293 (human 

5 embryonic kidney) cell line that i) expressed the dominant, constitutively active form of 

6 P-catenin (P-catenin S45Y) and ii) contained a transcriptionally regulated reporter 

7 construct consisting of four tandem TCF-4 binding elements (TBE's) functionally linked 

8 to the coding region of EGFP. Subsequently, roughly 20 million of these cells, 

9 representing a 2x fold coverage of the library, were then subjected to FACS analysis to 

10 identify perturbagens that disrupted the P-catenin S45Y driven activation of the reporter 

11 construct. 

12 Perturbagen identification may elucidate the function of known genes, or 

13 alternatively may work in a "black-box" approach to identify new genes, gene products, 

14 or cellular targets. Thus in some instances, perturbagens may be encoded by a previously 

15 identified gene (or gene fragment thereof). Such a gene may be one whose contribution 

16 to the disease pathway has previously been identified. Alternatively, the contribution of a 

17 gene to the pathway may have been previously unrecognized. In yet other cases, the 

1 8 perturbagen may be found to have no homology with any previously identified 

19 polynucleotide or proteinaceous agent. Such perturbagens may be derived from 

20 previously unidentified genes, or alternatively may be random sequences that have the 

21 proper conformation and/or chemical characteristics needed to alter or modulate one or 

22 more components of a pathway(s) that influences the phenotype under investigation. In 

23 the methodology described herein, no prior knowledge of the perturbagen or of its 

24 corresponding gene, gene product or cellular target is necessary. Moreover, because it is 

25 possible for multiple perturbagens to assume similar secondary or tertiary conformations 

26 and/or have shared or related chemistries, two or more variants of the same perturbagen 

27 may be identified and isolated from a single library without any additional screening 

28 steps. Thus one need not spend laborious hours designing, redesigning, or manipulating 

29 any candidate molecules, and thus does not bias the experiment with preconceived 

30 conceptions of what will or will not induce the phenotype of interest. 
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1 B. Phenotypic Probes 

2 The invention encompasses both the phenotypic probes (perturbagens) described 

3 herewith and the polynucleotide sequences encoding them. As one of ordinary skill 

4 appreciates, such agents may be described by their RNA sequence, amino acid sequence, 

5 or correlative DNA sequence. Alternatively, the agents can be sufficiently described in 

6 terms of their identity as isolates of a library that exhibit a particular biological activity. 

7 Perturbagens may be encoded by a variety of genetic libraries, including those 

8 developed from cDNA, gDNA, and random, synthetic oligonucleotides synthesized using 

9 current available methods in chemistry (see, for example, Caponigro et al. (1998) 

10 'Transdominant genetic analysis of a growth control pathway." PNAS 95:7508-7513; 

1 1 Caruthers, M.H. et al. (1980) Nucleic Acids Symposium, Ser. 7:215-223; Horn, T. et al. 

12 (1980) Nucleic Acids Symposium, Ser. 7:225-232; Cwirla, S.E. et al. (1990) "Peptides on 

13 phage: a vast library of peptides for identifying ligands." Proc Natl Acad Sci 

14 87(16):6378-82). Alternatively, the perturbagen itself can be synthesized using chemical 

15 methods. For example, peptide and RNA synthesis can be performed using various 

16 techniques (Roberge, J.Y. et al. (1995) "A strategy for a convergent synthesis of N-linked 

17 glycopeptides on a solid support." Science 269:202-204; Zhang, X. et al. (1997) "RNA 

18 synthesis using a universal base-stable allyl linker." NAR 25(20): 3980-3983) and diverse 

19 combinatorial peptide libraries can be constructed using, a variety of strategies such as 

20 the multipin strategy, the tea bag method", or the split-couple-mix method (see, for 

21 instance, Geysen, H.M. et al (1984) "Use of peptide synthesis to probe viral antigens for 

22 epitopes to a resolution of a single amino acids." PNAS 81 :3998-4002; Houghten, R.A. 

23 (1985) "General methods for the rapid solid phase synthesis of large numbers of peptides: 

24 specificity of antigen-antibody interaction at the level of individual amino acids." /WAS 

25 82:5131-5135; Lam, K.S. et al. (1991) "A new type of synthetic library for identifying 

26 ligand binding activity." Nature 354:82-84; Al-Obeidi, F. et al. (1998) "Peptide and 

27 Peptidomimetic Libraries." Molecular Biotechnology: 9:205-223). Automated synthesis 

28 may be achieved using commercially available equipment such as the ABI 43 1 A peptide 

29 synthesizer (Perkin-Elmer). 

30 In some cases, the polynucleotide sequence encoding a perturbagen represents a 

31 fragment of an existing gene. Using currently available software, it is possible to identify 
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1 the full length cDNA by aligning the perturbagen encoding sequence with pre-existing 

2 sequences maintained in, for instance, publicly available genomic and/or EST data bases. 

3 In situations where the gene has not been identified, the perturbagen can be readily used 

4 to reverse engineer and identify the gene from which the phenotypic probe is derived. 

5 In the case where a perturbagen is encoded by only a portion of a particular gene, 

6 the nucleic acid sequence of such a perturbagen may be extended utilizing a partial 

7 nucleotide sequence and employing various PCR-based methods known in the art to 

8 detect upstream sequences. One such method, restriction site PCR, uses universal and 

9 nested primers to amplify unknown sequence from genomic DNA within a cloning vector 

10 (Sarkar, G. (1993) "Restriction-site PCR: a direct method of unknown sequence retrieval 

1 1 adjacent to a known locus by using universal primers." PCR Methods Applic. 2:3 18-322). 

12 Another method, inverse PCR, uses primers that extend in divergent directions to amplify 

13 unknown sequence from a circularized template. The template is derived from restriction 

14 fragments comprising a known genomic locus and surrounding sequences (see Triglia, T. 

15 et al. (1988) "A procedure for in vitro amplification of DNA segments that lie outside the 

16 boundaries of known sequences." NAR. 16:8186). A third method, capture PCR, 

17 involves PCR amplification of DNA fragments adjacent to known sequences in human 

18 and yeast artificial chromosome DNA (Lagerstrom, M. et al. (1991) "Capture PCR: 

19 efficient amplification of DNA fragments adjacent to a known sequence in human and 

20 YAC DNA." PCR Methods Applic. 1:111-119). In this method, multiple restriction 

21 enzyme digestions and ligations may be used to insert an engineered double stranded 

22 sequence into a region of known sequence before performing PCR. Other methods which 

23 may be used to retrieve unknown sequences are known in the art (Parker, J .D. et al 

24 (1991) "Targeted gene walking polymerase chain reaction." NAR. 19:3055-3060). In 

25 addition, one may use nested primers and PROMOTERFTNDER libraries (Clontech, Palo 

26 Alto, CA) to walk genomic DNA. This procedure avoids the need to screen libraries and 

27 is useful in finding intron/exon junctions. For all PCR based methods, primers may be 

28 designed, using commercially available software such as OLJGO 4.06 Primer Analysis 

29 software (National Biosciences, Plymouth MN) or another appropriate program, to be 

30 about 22 to 30 nucleotides in length, to have a GC content of about 50% or more, and to 

31 anneal to the template at temperatures of about 68°C to 72°C. 



17 



1 In one particular embodiment, the invention encompasses proteinaceous 

2 perturbagens, biologically active fragments, (N-terminal, C-terminal, or internal) or 

3 variants thereof. Proteinaceous perturbagens can exert their effects by multiple means. 

4 For example, a peptide may act by binding and disrupting the interactions between two or 

5 more proteinaceous entities within the cell Alternatively, a peptide perturbagen can bind 

6 to, and disrupt translation of a particular mRNA molecule. As still another alternative, 

7 peptide perturbagens may bind to genomic DNA and disrupt gene expression by altering 

8 the ability of one or more transcription factor(s) (e.g. activators or repressors) from 

9 binding to a critical enhancer/promoter region of the regulatory region of the gene. 

10 Penetrance is another property of perturbagens. Penetrance is defined as the 

1 1 number of cells exhibiting a particular phenotype divided by the total number of cells in 

12 the experiment (when a perturbagen is present in the cells), minus the total number of 

13 cells exhibiting a particular phenotype divided by the total number of cells in the 

14 experiment when the perturbagen is not present in the cells. The penetrance of any given 

15 pertubagen can vary depending upon a variety of parameters including 1) the cell type it 

16 is being expressed in, 2) the vector being used to express the perturbagen, 3) the 

17 biological stability (half-life) of the perturbagen or mRNA encoding the perturbagen 4) 

1 8 the concentration of the perturbagen in the cell, as well as other parameters. Thus 

19 although penetrance is a factor that impacts how immediately a given perturbagen can be 

20 seen to exert an effect, in some instances, a desirable, biologically active perturbagen 

21 may present a relatively low rate of penetrance. As one of ordinary skill will appreciate, 

22 perturbagens of low penetrance may be obtained and manipulated via standard cycling 

23 and/or amplification procedures. Thus, some preferred perturbagens may exhibit as low 

24 as 1-2% penetrance. Other preferred perturbagens may exhibit between 2% and 5% 

25 penetrance, between 5 and 10% penetrance, 10% and 20% penetrance, between 20% and 

26 50% penetrance, or even in some instances, between 50% and 100% penetrance. 

27 In some instances, the action, penetrance, or biological activity of a perturbagen 

28 may be affected in some part by the scaffold to which it is associated. In some cases (for 

29 instance, in situations where the agent is shorter than 30 amino acids) the scaffold may 

30 drive the perturbagen to adopt a conformation that enhances its biological action. In still 

31 other instances, one or more neighboring residues from, e.g., the C-terminus of a 
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1 scaffold, may act in concert with the perturbagen to enhance the functionality of the 

2 molecule. In cases such as these, the complete biologically active sequence may include 

3 one or more C-terminal residues derived from the scaffold molecule. Multiple techniques 

4 may be used to determine the contribution of the scaffold to the phenotypic effect of any 

5 given perturbagen. Initially, perturbagen sequences can be shifted to alternative scaffolds 

6 and retested for biological activity. If these procedures result in a significant loss of the 

7 perturbagen' s activity, a fusion between the perturbagen and, for instance, the 30-most 

8 residues from the C-terminus of the scaffold may be linked to a second scaffold molecule 

9 and retested for biological activity. Should operations such as these lead to the recovery 

10 of lost activity, experiments in which smaller and small portions of the primary scaffold 

11 are associated with the perturbagen can be tested. 

1 2 In other embodiments, the phenotypic probe is an RNA molecule which is itself 

13 active (i.e. is not acting through the correlative encoded protein or peptide that results 

14 from translation of the RNA). There are multiple mechanisms by which RNA molecules 

15 may act to inhibit or activate a biological pathway. In some instances, the RNA 

16 perturbagen acts in an antisense mode to disrupt ribonucleic acid transcription or 

17 translation of a cellular mRNA target via hybridization to a target ribonucleic acid 

18 (Weiss, B. et al. (1999) "Antisense RNA gene therapy for studying and modulating 

19 biological processes." Cell Mol Life Sc.i 55(3):334-58). In this context the term 

20 "antisense" refers to any composition containing a nucleic acid sequence which is 

21 complementary to the "sense" strand of a particular target DNA (see, for example, 

22 Chadwick, D.R. et al. (2000) "Antisense RNA sequences targeting the 5' leader 

23 packaging signal region of human immunodeficiency virus type- 1 inhibits viral 

24 replication at post-transcriptional stages of the life cycle." Gene Therapy 7(16):1362-8). 

25 In other instances, RNA perturbagens may act as a RNA-PRO agents, disrupting p- 

26 catenin-TCF pathway by interacting with one or more proteinaceous components (e.g. 

27 APC) of the cell (see Sengupta, D.J. (1999) "Identification of RNAs that bind to a 

28 specific protein using the yeast three-hybrid system." RNA 5:596-601). In still other 

29 instances, RNA perturbagens may act as a triplex-forming oligonucleotide (TFO) agent to 

30 interact with promoter sequences, exons, introns, or other portions of genomic DNA to 

3 1 disrupt or activate transcription of components of the p-catenin-TCF pathway (see Postel, 
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1 RH. et al (1989) "Evidence that a triplex-forming oligonucleotide binds to the c-myc 

2 promoter in HeLa cells, thereby reducing c-myc RNA levels." PNAS 88: 8227-823 1 ; 

3 Svinarchuk, F. et al. (1997) "Recruitment of transcription factors to the target site by 

4 triplex-forming oligonucleotides." NAR 25:3459-3464). 

5 There does not appear to be a necessary correlation between size of a particular 

6 RNA (or proteinaceous) perturbagen and penetrance. Instead, the penetrance of 

7 perturbagens are dependent upon the perturbagen stability or half-life, the perturbagen' s 

8 ability to achieve access to the target molecule, and other factors. 

9 Perturbagens may also exhibit cross-reactivity. A variety of host target proteins can 

10 contain similarities in both the primary and secondary structure. As a result, one or more 

1 1 of the agents described herein may exhibit affinity for one or more target 

12 variants/isoforms present in nature. Similarly, agents identified in the following screens 

13 may exhibit affinity for two or more functionally unrelated proteins that contain regions 

14 or domains that share homology or related functional groups. Thus, for instance, a 

15 perturbagen that recognizes a zinc-binding domain of one protein, may also show affinity 

16 for the homologous (and functionally equivalent) region of a second protein (see, e.g., 

17 Mavromatis K. O. et al. (1997) "The carboxyl-terminal zinc-binding domain of the 

18 human papillomavirus E7 protein can be functionally replaced by the homologous 

19 sequences of the E6 protein." Viral Research 52(1): 109-18). In cases where such 

20 interactions lead to relevant biological phenotypes, the underlying mechanism(s) may 

2 1 differ considerably from those brought about by the original perturbagen-target 

22 interactions. Furthermore, in cases where an agent exhibits cross reactivity with 

23 secondary targets, said agents may be useful in a broader set of therapeutic and diagnostic 

24 applications than originally intended. 

25 Host range is another characteristic of perturbagens. The term "host range" refers 

26 to the breadth of potential host cells that exhibit perturbagen-induced phenotypes. In 

27 some instances, such as the case where the perturbagen is represented by an apoptosis- 

28 inducing fragment of BID, the host range is broad, due to the near ubiquitous 

29 participation of BID or BID-like agents in the apoptotic pathway of many cells. In 

30 contrast, some perturbagens have a very limited host range due to, for instance, the 

3 1 restricted expression of the perturbagen target. 
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1 C. Sequence Variants 

2 In another embodiment, the invention includes sequence variants of both the 

3 phenotypic probes and the polynucleotide sequences that encode them. Thus, in the case 

4 of proteinaceous perturbagens, variants contain at least one amino acid substitution, 

5 deletion, or insertion from the original isolated form of the perturbagen that provides 

6 biological properties that are substantially similar to those of the initial perturbagen. 

7 Similarly, variants of RNA-based phenotypic probes contain at least one nucleotide 

8 substitution, deletion, or insertion when compared to the original isolated sequence. 

9 In addition to being described by their respective sequence, variants may also be 

10 identified by the relative amounts of homology they have in common with the original 

1 1 perturbagen sequence. Alternatively, a variant of a proteinaceous perturbagen may be 

12 described in terms of the nature of an amino acid substitution. "Conservative" 

13 substitutions are those in which the substituting residue is structurally or functionally 

14 similar to the substituted residue. In non-conservative substitutions, the substituting and 

15 substituted residue will be from structurally or functionally different classes. For the 

16 purposes herein, these classes are as follows: 1. Electropositive: R, K,H; 2. 

17 Electronegative: D,E; 3. Aliphatic: V,L,I,M; 4. Aromatic: F,Y,W; 5. Small: A,S,T,G,P,C; 

18 6. Charged: R,K,D,E,H; 7. Polar: S,T,Q,N,Y,H,W; and Small Hydrophilic: C,S,T. 

19 Interclass substitutions generally are characterized as nonconservative, while intraclass 

20 substitutions are considered to be conservative. 

21 In some instances, variant polypeptides sequences can have 65-75% homology 

22 with the original agent. In other embodiments, variants have between 75% and 85% 

23 homology with the original agent. In still other embodiments, variants will have between 

24 85% and 95% homology with the original perturbagen agent. In yet other embodiments, 

25 variants have between 95% and greater than 99% polypeptide sequence identity with the 

26 original perturbagen agent. In some cases, the homology between two perturbagens 

27 (variants) is confined to a small region of the molecule (e.g. a motif). Such conserved 

28 sequences are often indicative of regions that contain biologically important functions 

29 and suggest the perturbagens share a common cellular target. In these situations, while 

30 only limited and conservative amino acid changes are desirable within the region of the 
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1 motif, greater levels of variation can exist in adjacent and more distal portions of the 

2 polypeptide. 

3 Like their proteinaceous counterparts, variants of RNA perturbagens may also be 

4 described in terms of percent homology. In some instances, the variant ribonucleotide 

5 sequences can have 65-75% homology with the original agent. In other embodiments, 

6 the variants have between 75% and 85% homology with the original agent or between 

7 85% and 95% homology with the original perturbagen sequence, or even between 95% 

8 and greater than 99% sequence identity with the original perturbagen agent. Again, 

9 greater variation can, in some embodiments, exist outside an identified region/motif 

10 without altering biological activity. 

1 1 Lastly, in reference to the DNA sequences encoding proteinaceous perturbagens, 

12 one who is skilled in the art will appreciate that the degree of variance will depend upon 

13 and/or reflect the degeneracy of the genetic code. As one in the art appreciates, a given 

14 protein sequence is equivalently encoded by a large number of polynucleotide sequences. 

15 Therefore, the invention encompasses each variation of polynucleotide sequence that 

16 encodes the given perturbagen, such variations being made in accordance with the 

17 standard triplet genetic code as applied to the polynucleotide sequence of each 

18 perturbagen. For each proteinaceous perturbagen described by amino acid sequence 

19 herein, all such corresponding DNA variations are to be considered as being specifically 

20 disclosed. 

21 Variants of phenotypic probes may arise by a variety of means. Some variants 

22 may be artifactual and result from, for instance, errors that occur in the process of PCR 

23 amplification or cloning of the perturbagen encoding sequence. Alternatively, variants 

24 may be constructed intentionally. For instance, it may be advantageous to produce 

25 nucleotide sequences encoding perturbagens possessing a substantially different codon 

26 usage. Codons may be selected to increase the rate at which expression of the peptide or 

27 RNA occurs in a particular prokaryotic or eukaryotic cell in accordance with the 

28 frequency with which particular codons are utilized by the host (Berg, O.G. (1997) 

29 "Growth rate-optimized tRNA abundance and codon usage." J Mol Biol 270(4):544-50). 

30 Additional reasons for substantially altering the nucleotide sequence encoding 

31 proteinaceous perturbagens (without altering the encoded amino acid sequences) include, 
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1 but are not limited to, producing RNA transcripts that have increased half -life. This may 

2 be accomplished by altering a sequence's structural stability (see, for example, Gross, G. 

3 et al. (1990) "RNA primary sequence or secondary structure in the translational initiation 

4 region controls expression of two variant interferon-beta genes in Escherichia coli." J 

5 Biol Chem. 265(29): 17627-36; Ralston, C. Y. et al. (2000) "Stability and cooperativity of 

6 individual tertiary contacts in RNA revealed through chemical denaturation " Nat Struct 
1 Biol. 7(5):371-4), or through addition of untranslated sequences that increase RNA 

8 stability/half-life through RNA-protein interactions (see, for example, Wang ,W. et al. 

9 (2000) "HuR regulates cyclin A and cyclin Bl mRNA stability during cell proliferation." 

10 EMBO J. 19(10):2340-50; Shetty, S. and Idell, S. (2000) "Posttranscriptional regulation 

11 of plasminogen activator inhibitor- 1 in human lung carcinoma cells in vitro." Am J 

12 Physiol Lung Cell Mol Physiol 278(l):L148-56). Also included the category of 

13 intentional variants are those whose sequence has been altered in order to add or deleted 

14 sites involved in post-translational modification. Included in this list are variants in 

15 which phosphorylation sites, acetylation sites, methylation sites, and/or glycosylation 

16 sites have been added or deleted (see, for example, Wicker-Planquart, C. (1999) "Site- 

17 directed removal of N-glycosylation sites in human gastric lipase." Eur J Biochem. 

1 8 262(3):644-5 1 ; Dou, Y. (1999) "Phos-phorylation of linker histone HI regulates gene 

19 expression in vivo by mimicking HI removal." Mol Cell 4(4):641-7). 

20 Variants may also arise as a result of simple and relatively routine techniques 

21 involving random mutagenesis or "DNA shuffling"; procedures that are often used to 

22 rapidly evolve perturbagen encoding sequences and allow identification of variants that 

23 have increased biological stability or activity (see, for instance, Ner, S.S. et al. (1988) "A 

24 simple and efficient procedure for generating random point mutations and for codon 

25 replacements using mixed oligonucleotides." DNA 7: 127-134; Stemmer ,W. (1994) 

26 "Rapid evolution of a protein in vitro by DNA shuffling." Nature 370:389-391). For 

27 instance, in mutagenic PCR, the fragment encoding the perturbagen is PCR amplified 

28 under conditions that increase the error rate of Taq polymerase. In general, this is 

29 accomplished by i) increasing the MgCl 2 concentrations to stabilize non-complementary 

30 pairings, ii) addition of MnCl 2 to diminish template specificity of the polymerase and iii) 

3 1 increasing the concentration of dCTP and dTTP to promote misincorporation of basepairs 
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1 in the reaction. As a result of this process, the error rate of Tag polymerase may be 

2 increased from 1.0 x 10~ 4 errors per nucleotide per pass of the polymerase, to 

3 approximately 7 x 10" 3 errors per nucleotide per pass. Amplifying a perturbagen- 

4 encoding sequence under these conditions allows the development of a library of 

5 dissimilar sequences which can subsequently be screened for variants that exhibit 

6 improved biological activity. 

7 In addition to variants that are created by artificial or accidental means, natural 

8 variants may also exist. For instance, in the course of screening any given genomic or 

9 cDNA library, it is possible that a perturbagen, derived from a sequence that exists in 

10 multiple copies within the genome (e.g. duplications, repetitive sequences), may be 

1 1 isolated numerous times. Such sequences often contain polymorphisms that result in 

12 alterations in the encoded RNA and polypeptide sequence (see, for example, Satoh, H. et 

13 al. (1999) "Molecular cloning and characterization of two sets of alpha-theta genes in the 

14 rat alpha-like globin gene cluster." Gene 230(l):91-9) and thus, may represent natural 

15 variants of the perturbagen agent. Alternatively, if multiple libraries are utilized to screen 

16 for perturbagens and two or more of those libraries are derived from unrelated 

17 individuals, dissimilar tissues, or contrary periods in the development of a tissue (e.g. 

18 adult vs. fetal tissue) it is possible that variants may be isolated as a result of allelic 

19 variation (see, for example, Posnett, D.N. (1990) "Allelic variations of human TCR V 

20 gene products." Immunol Today. ll(10):368-73). Variants of phenotypic probes may 

21 arise by these and other means. 

22 Variants of any given perturbagen may in some instances exhibit additional 

23 biological properties. For instance, perturbagens that previously recognized only a single 

24 target may demonstrate broadened specificity, e.g., may bind multiple isoforms or 

25 serotypes of a target in response to the alteration of a single amino acid in the perturbagen 

26 variant. Similarly, a perturbagen having a specific phenotype in one cell may exhibit 

27 additional phenotypes or may exhibit a broader effective host range after making small 

28 alterations in the perturbagen variant sequence. 

29 D. Biologically Active Fragments 

30 Some embodiments of the invention encompass "biologically active fragments" of 

31 a given proteinaceous or RNA-based perturbagen. Biologically active fragments may be 
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1 compromised of N-terminal, C-terminal, or internal fragments of peptide perturbagens, or 

2 5', 3 'or internal fragments of RNA perturbagens. In some instances, the fragment 

3 encodes or represents portions of a natural gene. In other instances the fragment is 

4 derived from a larger polynucleotide or polypeptide that has no known natural 

5 counterpart. In still other instances, biologically active regions of a perturbagen can be 

6 artificially synthesized (by chemical or recombinant methods) so that multiple, tandem 

7 copies of the phenotypic probe are covalently linked together and expressed. All such 

8 biologically active perturbagen fragments are, in turn, encoded by a variety of correlative 

9 DNA sequences. 

10 The biologically active portion of a molecule can be identified by several means. 

11 In some instances, biological relevant regions can be deduced by simple physical 

12 mapping of families of overlapping sequences isolated from a phenotypic assay 

13 (Hingorani, K. et al. (2000) "Mapping the functional domains of nucleolar protein B23." 

14 J Biol Chem May 26). For instance, in the course of any given screen, multiple 

15 perturbagens, derived from alternative breakpoints of the same gene, may be isolated 

16 from one or more genetic libraries. (Figure 2). The smallest region that is common to all 

17 of the perturbagens can demarcate the area of biological importance. 

18 Alternatively, critical regions of a perturbagen can frequently be distinguished by 

19 comparing the polynucleotide and/or amino acid sequence of two or more perturbagens 

20 that share a common target (see, for example, Grundy, W.N. (1998) "Homology 

21 detection via family pair-wise search." J Comput Biol 5(3):479-9; Gorodkin, J. et al. 

22 (1997) "Finding common sequence and structure motifs in a set of RNA sequences." 

23 Ismb 5:120-3). In this instance, conserved sequences (or motifs) that are identified by 

24 this form of analysis often provide important clues necessary to determine biologically 

25 important regions of a given molecule. Alternatively, methods that identify biologically 

26 relevant regions by altering or deleting regions of the perturbagen molecule can also be 

27 used. For instance, the gene encoding a particular perturbagen can be subjected to 

28 deletion analysis whereby portions of the gene are removed in a systematic fashion, thus 

29 allowing the remaining entity to be retested for its ability to evoke a biological response 

30 (see, for example, Huhn, J. et al. (2000) "Molecular analysis of CD26-mediated signal 

3 1 transduction in cells." Immunol Lett 72(2): 127-132; Davezac, N. et al. (2000) 



25 



1 "Regulation of CDC25B phosphatases subcellular localization." Oncogene 19(18):2179- 

2 85). 

3 Alternatively, biologically critical regions of a molecule can be identified by 

4 inducing mutations in the sequence encoding the polypeptide (see, for example, Ito, Y. et 

5 al. (1999) "Analysis of functional regions of YPM, a superantigen derived from gram- 

6 negative bacteria." Eur J Biochem; 263(2):326-37; Kim, S.W. et al. (2000) 

7 "Identification of functionally important amino acid residues within the C2-domain of 

8 human factor V using alanine-scanning mutagenesis." Biochemistry 39(8): 195 1-8.). 

9 Subsequent testing of the variants of said molecule for biological activity enables the 

10 investigator to identify regions of the perturbagen that are both critical and sensitive to 

1 1 manipulation. Furthermore, molecular probes such as monoclonal antibodies and 

12 epitope-specific peptides can be useful in the identification of biologically important 

13 regions of a perturbagen (see, for example, Midgley, C.A. et al. (2000) "An N-terminal 

14 pl4ARF peptide blocks Mdm2-dependent ubiquitination in vitro and can activate p53 in 

15 vivo." Oncogene 19(19):2312-23; Lu, D. et al (2000) "Identification of the residues in 

16 the extracellular region of KDR important for interaction with vascular endothelial 

17 growth factor and neutralizing anti-KDR antibodies." J Biol Chem 275(19): 14321-30). 

18 In this procedure, probes that bind and thus mask specific regions of a perturbagen can be 

19 tested for their ability to block the biological activity of the molecule. These techniques 

20 (as well as others) can be used to map the boundaries of any given biologically active 

21 residues. 

22 E. Heterologous Sequences 

23 In another embodiment, the invention encompasses all heterologous forms of the 

24 phenotypic probes and the polynucleotide sequences encoding them described herewith. 

25 In this context, "heterologous sequence(s)" include versions of the perturbagens that are 

26 i) scaffolded by other entities, ii) tagged with marker sequences that can be recognized by 

27 antibodies or specific peptides, iii) altered to transform post-translational patterns of 

28 modification or iv) altered chemically so as to cyclize the molecule for alternative 

29 pharmacodynamic/pharmacokinetic properties. 
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1 1. Scaffolds 

2 Peptide perturbagens can be fused to protein scaffolds at N-terminal, C-terminal, 

3 or internal sites. Similarly, RNA derived perturbagens can be fused to RNA sequences at 

4 5' , 3' or internal sites. The fusion of a perturbagen to a second entity can increase the 

5 relative effectiveness of the perturbagen by increasing the stability of either the 

6 messenger RNA (mRNA) or protein of said agent. In some instances, scaffolds may be a 

7 relatively inert protein, (i.e. having no enzymatic activity or fluorescent properties) such 

8 as hemagglutinin. Such proteins can be stably expressed in a wide variety of cell types 

9 without disrupting the normal physiological functions of the cell. In other instances, 

10 scaffolds may serve a dual function, e.g., increasing perturbagen stability while at the 

1 1 same time, serving as an indicator or gauge of the level of perturbagen expression. In this 

12 case, the scaffold may be an autofluorescent molecule such as a green fluorescent protein 

13 (Clontech) or embody an enzymatic activity capable of altering a substrate in such a way 

14 that it can be detected by eye or instrumentation (e.g. p galactosidase). For example, in 

15 the invention described herein, various molecular techniques that are common to the field 

16 are used to link the perturbagen library to, e.g., the C-terminus of a nonfluorescent variant 

17 of GFP. "dEGFP" (also referred to as "dead-GFP") is one such nonfluorescent variant 

18 brought about by conversion of Tyr -> Phe at codon 66 of EGFP (Clontech). By linking 

19 the perturbagen library to this molecule, each library member is fused to a separate 

20 dEGFP molecule. Such chimeric fusions can easily be detected by Western Blot analysis 

21 using antibodies directed against GFP and are useful in determination of intracellular 

22 expression levels of perturbagens. In addition, by modifying the perturbagen sequences 

23 or the scaffold to which they are attached with various localization signals, the 

24 perturbagen may be directed to a particular compartment within the host cell. For 

25 example, proteinaceous perturbagens can be directed to the nucleus of certain cell types 

26 by attachment of a nuclear localization sequence (NLS); a heterogeneous sequence made 

27 up of short stretches of basic amino acid residues recognized by importins alpha and/or 

28 beta. 

29 2. Antibody-Tagged Perturbagens 

30 Perturbagens can be constructed to contain a heterologous moiety (a "tag") that is 

31 recognized by a commercially available antibody. Such heterologous forms may 
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1 facilitate studies of subjects including, but not limited to, i) perturbagen subcellular 

2 localization, ii) intracellular concentration assessment and iii) target binding interactions. 

3 In addition, the tagging of a perturbagen may also facilitate purification of fusion proteins 

4 using commercially available matrices (see, for example, James, E.A. et al. "Production 

5 and characterization of biologically active human GM-CSF secreted by genetically 

6 modified plant cells." Protein Expr Purif. 19(1): 13 1-8; Kilic, F. and Rudnick, G. (2000) 

7 "Oligomerization of serotonin transporter and its functional consequences." Proc Natl 

8 Acad Sci USA, 97(7):3106-1 1). Such moieties include, but are not limited to 

9 glutathione-S-transferase (GST), maltose binding protein (MBP), thioredoxin (Trx), 

10 calmodulin binding peptide (CBP), 6-His, FLAG, c-myc, and hemagglutinin (HA). GST, 

11 MBP, Trx, CBP, and 6-His enable purification of their cognate fusion proteins on 

12 immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal-chelate 

13 resins, respectively. FLAG, c-myc and HA enable immunoaffinity purification of fusion 

14 proteins using commercially available monoclonal and polyclonal antibodies that 

15 specifically recognize these epitope tags. Such fusion proteins may also be engineered to 

16 contain a proteolytic cleavage site located between the perturbagen sequence and the 

17 heterologous protein/tag sequence, so that the perturbagen may be cleaved away from the 

18 heterologous moiety following purification. A variety of commercially produced kits 

19 may be used to facilitate expression and purification of fusion proteins. 

20 3, Chemically Modified Perturbagens 

21 In addition to the chimeric variants described above, chemical modification 

22 encompass a variety of modifications including, but not limited to, perturbagens that have 

23 been radiolabeled with 32 P or 35 S, acetylated, glycosylated, or labeled with fluorescent 

24 molecules such as FTTC or rhodamine. These modifications may be directly imposed on 

25 the perturbagen itself (see, for example, Shuvaev, V.V. et al. (1999) "Glycation of 

26 apolipoprotein E impairs its binding to heparin: identification of the major glycation 

27 site." Biochim Biophys Acta 1454(3):296-308; Dobransky, T. et al. (2000) "Expression, 

28 purification and characterization of recombinant human choline acetyltransf erase: 

29 phosphorylation of the enzyme regulates catalytic activity." Biochem J. 349(Pt 1): 141- 

30 151). Alternatively, changes may be made to the polynucleotide sequence encoding the 

31 perturbagen so as to alter the pattern of phosphorylation, acetylation, glycosylation, or 
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1 that lead to cyclization of peptides in order to alter membrane permeability and/ or 

2 pharmacodynamic- pharmacokinetic properties (see, for example, Borchardt, R.T. (1999) 

3 "Optimizing oral adsorption of peptides using prodrug strategies." J Control Release 

4 62(l-2):231-8.). 

5 F. Hybridization 

6 The invention also encompasses polynucleotide sequences that are capable of 

7 hybridizing to the claimed polynucleotide sequences encoding phenotypic probes and 

8 said variants of such entities described previously, under various conditions of 

9 stringency. Such reagents may be useful in i) therapeutics, ii) diagnostic assays, iii) 

10 immunocytology, iv) target identification, and v) purification. For example, if the 

1 1 sequence encoding a particular perturbagen is introduced into a subject for gene 

12 therapeutic purposes, it may be necessary to monitor the success of integration and the 

1 3 levels of expression of said agent by Southern and Northern Blot analysis respectively 

14 (Pu, P. et al. (2000) "Inhibitory effect of antisense epidermal growth factor receptor RNA 

15 on the proliferation of rat C6 glioma cells in vitro and in vivo." JNeurosurg. 92(1):132- 

16 9). In other instances, hybridization may be used as a tool to define or describe a 

17 perturbagen variant or fragment, and a hybridizing sequence thus may have direct 

1 8 relevance as a mimetic or other such therapeutic agent. 

19 The term "hybridization" refers to any process by which a strand of nucleic acid 

20 binds with a complementary or near-complementary strand through base pairing. There 

21 are several parameters that play a role in determining whether two polynucleotide 

22 molecules will hybridize including salt concentrations, temperature, and the presence or 

23 absence of organic solvents. For instance stringent salt concentrations will ordinarily be 

24 less than about 750mM NaCl and 75mM trisodium citrate, preferably less than about 

25 500mM NaCl and 50mM trisodiium citrate, and most preferably less than about 250mM 

26 NaCl and 25mM trisodium citrate. Low stringency hybridization can be obtained in the 

27 absence of organic solvent (e.g. formamide) while high stringency hybridization can be 

28 obtained in the presence of at least about 35% formamide, and most preferably at least 

29 about 50% formamide. Stringent temperature conditions will ordinarily include 

30 temperatures of at least about 30°C, more preferably of at least about 37°C, and most 

3 1 preferably of at least about 42°C. Varying additional parameters, such as hybridization 
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1 time, the concentration of detergent and the inclusion or exclusion of carrier DNA are 

2 well known to those skilled in the art. Various levels of stringency are accomplished by 

3 combining these various conditions as needed. In a preferred embodiment, hybridization 

4 will occur at 30°C in 750 mM NaCl, 75mM trisodium citrate, and 1% SDS. In a more 

5 preferred embodiment, hybridization will occur at 37°C in 500 mM NaCl, 50mM 

6 trisodium citrate, 1% SDS, 35% formamide and lOOug/ml denatured salmon sperm DNA 

7 (ssDNA). In a most preferred embodiment, hybridization will occur at 42°C in 250 mM 

8 NaCl, 25mM trisodium citrate, 1% SDS, 50% formamide and 200ug/ml denatured 

9 ssDNA. Useful variations on these conditions will be readily apparent to those skilled in 

10 the art. 

1 1 The washing steps that follow hybridization can also vary greatly in stringency. 

12 Wash stringency conditions can be defined by salt concentration and by temperature. As 

13 above, wash stringency can be increased by decreasing salt concentration or by 

14 increasing temperature. For example, stringent salt concentrations for the wash steps will 

15 preferably be less than about 30mM NaCl and 3mM trisodium citrate, and most 

16 preferably less than about 15mM NaCl and 1.5mM trisodium citrate. Stringent 

17 temperature conditions for the wash steps will ordinarily include temperatures of at least 

18 about 25°C, more preferably of at least about 42°C, and most preferably of at least about 

19 68°C. In a preferred embodiment, wash steps will occur at 25°C in 30mM NaCl, 3mM 

20 trisodium citrate and 0.1% SDS. In a more preferred embodiment, wash steps will occur 

21 at 42°C in 15mM NaCl, 1.5mM trisodium citrate and 0.1 % SDS. In a most preferred 

22 embodiment, wash steps will occur at 68°C in 15mM NaCl, 1.5mM trisodium citrate and 

23 0.1% SDS. Additional variations on these conditions will be readily apparent to those 

24 skilled in the art. 

25 G. Expression Vectors 

26 The DNA sequence encoding each perturbagen or target (or variant or fragment 

27 thereof) may be inserted into an expression vector which contains the necessary elements 

28 for transcriptional/translational control in a selected host cell. Thus the DNA sequence 

29 may be expressed for, e.g., testing in a bioassay such as those described herein, or in a 

30 binding assay such as those described herein, or for production and recovery of the 

31 proteinaceous agent. Methods which are well known to those skilled in the art are used 
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1 to construct expression vectors containing sequences encoding the perturbagens and the 

2 appropriate transcriptional and translational control elements. These methods include in 

3 vitro recombinant DNA techniques, synthetic techniques, and in vivo genetic 

4 recombination (see Sambrook, J. et al. (1989) "Molecular Cloning, A Laboratory 

5 Manual", Cold Spring Harbor Press, Plainview NY), 

6 Exemplary expression vectors may include one or more of the following: (i) 

7 regulatory sequences, such as enhancers, constitutive and inducible promoters, and/or (ii) 

8 5' and 3' untranslated regions, and/or (iii) mRNA stabilizing sequences or scaffolds, for 

9 optimal expression of the perturbagen in a given host. For instance, intracellular 

10 perturbagen levels can be modulated using alternative promoter sequences such as CMV, 

1 1 RS V, and S V40 promoters, to drive transcription (see, for example, Zarrin, A. A. et al. 

12 (1999) "Comparison of CMV, RSV, SV40 viral and Vlambdal cellular promoters in B 

13 and T lymphoid and non-lymphoid cell lines/' Biochim Biophys Acta. 1446(1-2): 135-9). 

14 Alternatively, inducible promoter systems, (e.g. ponesterone-induced promoter (PIND, 

15 Invitrogen, see Dunlop, J. et al. (1999) "Steroid hormone-inducible expression of the 

16 GLT-1 subtype of high-affinity 1-glutamate transporter in human embryonic kidney 

17 cells." Biochem Biophys Res Commun. 265(1): 101-5), tissue specific enhancers (see 

18 Scharf , D. et al (1994) Results Probl. Cell Differ. 20: 125-162), or scaffolding molecules 

19 (see, for example, see Abedi, M. et al. (1998), "Green fluorescent protein as a scaffold for 

20 intracellular presentation of peptides." Nucleic Acid Research 26(2):623-630) can be used 

21 to modulate intracellular perturbagen levels. 

22 A variety of paired expression vector/host systems may be utilized to contain and 

23 express sequences encoding the perturbagens. As one of ordinary skill will appreciate, 

24 the selection of a given system is dictated by the purpose of expression: e.g., bioassay, 

25 binding assay, or production of proteinaceous product for subsequent isolation and 

26 purification. Such systems include, but are not limited to, microorganisms such as 

27 bacteria transformed with recombinant bacteriophage, plasmid or cosmid DNA 

28 expression vectors; yeast transformed with yeast expression vectors, insect cell systems 

29 infected with viral expression vectors (e.g. baculovirus), plant cell systems transformed 

30 with viral expression vectors (e.g. tobacco mosaic virus, TMV) or with bacterial 

31 expression vectors (e.g. Ti or pBR322 plasmids; or mammalian cell systems {e.g. COS, 
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1 CHO, BHK, 293, 3T3) harboring recombinant expression constructs containing 

2 promoters derived from the genome of mammalian cells {e.g., metallothionine promoter) 

3 or from mammalian viruses {e.g., the adenovirus late promoter; the vaccinia virus 7.5 K 

4 promoter). The host cell employed does not limit the invention. 

5 In bacterial systems, a number of cloning and expression vectors may be selected 

6 depending upon the use intended for polynucleotide sequences encoding the 

7 perturbagens. For example, routine cloning, subcloning, and propagation of 

8 polynucleotide sequences encoding perturbagens can be achieved using a multifunctional 

9 E. coli vector such as PBLUESCREPT (Stratagene, La Jolla Ca). Ligation of sequences 

10 encoding perturbagens into the vector's cloning site disrupts the lacZ gene, allowing a 

1 1 colorimetric screening procedure for identification of transformed bacteria containing 

12 recombinant molecules. In addition, these vectors may be useful for in vitro 

13 transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of 

14 nested deletions in the cloned sequence, (see e.g., Van Heeke, G. and Schuster, S.M. 

15 (1989) "Expression of human asparagine synthetase in Escherichia coli." /. Biol Chem. 

16 264:5503-5509). When large quantities of perturbagens are needed, e.g. for the 

17 production of antibodies, vectors which direct high level expression of perturbagens may 

18 be used. Exemplary vectors feature the strong, inducible T5 or T7 bacteriophage 

19 promoter; the E. coli expression vector pUR278 (Ruther et a/., EMBO J., 2:1791-94 

20 (1983)), in which the gene protein coding sequence may be ligated individually into the 

21 vector in frame with the lac Z coding region so that a fusion protein is produced; pESf 

22 vectors (Inouye & Inouye, Nucleic Acids Res., 13:3101-09 (1985); Van Heeke et al.,J. 

23 Biol. Chem., 264:5503-9 (1989)); and the like. pGEX vectors may also be used to express 

24 foreign polypeptides as fusion proteins with glutathione S-transferase (GST), In general, 

25 such fusion proteins are soluble and can easily be purified from lysed cells by adsorption 

26 to glutathione-agarose beads followed by elution in the presence of free glutathione. The 

27 pGEX vectors are designed to include thrombin or factor Xa protease cleavage sites so 

28 that the cloned anaphylatoxin C3a receptor gene protein can be released from the GST 

29 moiety. 

30 Yeast expression systems may also be used for production of perturbagens. A 

31 number of vectors containing constitutive or inducible promoters such as alpha factor, 
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1 alcohol oxidase, and PGH promoters, may be used in the yeast Saccharomyces cerivisiae 

2 or related strains. In addition, such vectors can be designed to direct either the secretion 

3 or intracellular retention of expressed proteins and enable integration of foreign 

4 sequences in the host genome for stable propagation, (see, e.g. Bitter, G.A. et al. (1987) 

5 "Expression and secretion vectors for yeast." Methods Enzymology. 153:516-544; and 

6 Scorer, C.A. et al. (1994) "Rapid selection using G418 of high copy number 

7 transformants of Pichia pastoris for high-level foreign gene expression." Biotechnology 

8 12:181-184). 

9 In mammalian host cells, a number of viral-based expression systems may be 

10 utilized. In cases where an adenovirus is used as an expression vector, the gene coding 

1 1 sequence of interest may be ligated to an adenovirus transcription/translation control 

12 complex, e.g., the late promoter and tripartite leader sequence. This chimeric gene may 

13 then be inserted in the adenovirus genome by in vitro or in vivo recombination. Insertion 

14 in a non-essential region of the viral genome (e.g., region El or E3) will result in a 

15 recombinant virus that is viable and capable of expressing gene protein in infected hosts. 

16 (e.g., see Logan et al, Proc. Natl Acad. Set USA, 81:3655-59 (1984)). Specific 

17 initiation signals may be used to achieve more efficient translation of sequences encoding 

18 the perturbagen. Such signals include the ATG initiation codon and adjacent sequences, 

19 e.g. the Kozak sequence. In cases where sequences encoding the perturbagen and its 

20 initiation codon and upstream regulatory sequences are inserted into the appropriate 

21 expression vector, no additional transcriptional or translational control signals may be 

22 needed. However, in cases where only coding sequence is inserted, exogenous 

23 translational control signals including an in-frame ATG initiation codon are provided by 

24 the vector. Furthermore, the initiation codon must be in phase with the reading frame of 

25 the desired coding sequence to ensure translation of the entire insert. Such exogenous 

26 translational elements and initiation codons may be of various origins, both natural and 

27 synthetic. The efficiency of expression may be enhanced by the inclusion of appropriate 

28 transcription enhancer elements, transcription terminators, etc. (see Bitter, et al, Methods 

29 in Enzymol t 153:516-44 (1987)). Alternatively, many of these elements are not required 

30 in vectors that are specific for RNA-based perturbagens. Instead, sequences that stabilize 

3 1 the RNA transcript or direct the RNA sequence to a particular compartment will be 
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1 included (see, for instance, Wood Chuck post transcriptional regulatory element, WPRE, 

2 Zufferey, R. et al. (1999) "Woodchuck hepatitis virus posttranscriptional regulatory 

3 element enhances expression of transgenes delivered by retroviral vectors." / Virol 

4 73(4):2886-92). 

5 Plant systems may also be used for expression of perturbagens. Transcription of 

6 sequences encoding perturbagens may be driven by viral promoters, e.g. the 35S and 19S 

7 promoters of CaMV used alone or in combination with the omega leader sequence from 

8 TMV (Takamatsu, N. (1991) "Deletion analysis of the 5' untranslated leader sequence of 

9 tobacco mosaic virus RNA." J Virology 65: 1619-22). Alternatively, plant promoters 

10 such as the small subunit of RUBISCO or heat shock promoters may be used, (see, for 

1 1 example, Coruzzi, G. et al. (1984) 'Tissue-specific and light-regulated expression of a 

12 pea nuclear gene encoding the small subunit of ribulose-l,5-bisphosphate." EMBO J. 

13 3:1671-80; Broglie, R. et al. (1984) "Light-regulated expression of a pea ribulose-1,5- 

14 bisphosphate carboxylase small subunit gene in transformed plant cells." Science 24:838- 

15 843). 

16 In an insect system, Autographa californica nuclear polyhedrosis virus (AcNPV) 

17 is used as a vector to express foreign genes. The virus grows in Spodoptera frugiperda 

1 8 cells. The gene coding sequence may be cloned individually into non-essential regions 

19 (for example the polyhedrin gene) of the virus and placed under control of an AcNPV 

20 promoter (for example the polyhedrin promoter). Successful insertion of gene coding 

21 sequence will result in inactivation of the polyhedrin gene and production of non- 
22 occluded recombinant virus (/. e. , virus lacking the proteinaceous coat coded for by the 

23 polyhedrin gene). These recombinant viruses are then used to infect Spodoptera 

24 frugiperda cells in which the inserted gene is expressed {see, e.g., Smith, et al, J. Virol. 

25 46: 584-93 (1983); U.S. Patent No. 4,745,051). 

26 In addition, a host cell strain may be chosen that modulates the expression of the 

27 inserted sequences, or modifies and processes the gene product in the specific fashion 

28 desired. Such modifications {e.g., glycosylation) and processing (e.g., cleavage) of 

29 protein products may be important for the function of the protein. Different host cells 

30 have characteristic and specific mechanisms for the post-translational processing and 

3 1 modification of proteins. Appropriate cell lines or host systems can be chosen to ensure 
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1 the correct modification and processing of the foreign protein expressed. To this end, 

2 eukaryotic host cells that possess the cellular machinery for proper processing of the 

3 primary transcript, glycosylation, and phosphorylation of the gene product may be used. 

4 Such mammalian host cells include but are not limited to CHO, VERO, BHK, HeLa, 

5 COS, MDCK, 293, 3T3, WI38, etc. 

6 The selected construct can be introduced into the selected host cell by direct DNA 

7 transformation or pathogen-mediated transfection. The terms "transformation" and 

8 "transfection" are intended to refer to a variety of art-recognized techniques for 

9 introducing foreign nucleic acid into a host cell, including calcium phosphate or calcium 

10 chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, or 

1 1 electroporation. Preferred technologies for introducing perturbagens into mammalian 

12 cells include, but are not limited to, retroviral infection as well as transformation by EBV 

13 or similar episomally-maintained viral vectors (Makrides, S.C. (1999) "Components of 

14 vectors for gene transfer and expression in mammalian cells/' Protein Expr Purif 

15 17(2): 183-202). Other suitable methods for transforming or transfecting host cells can be 

16 found in Maniatis, T. et al ("Molecular Cloning: A Laboratory Manual." Cold Spring 

17 Harbor Laboratory Press) and other standard laboratory manuals. 

1 8 For long term production of recombinant proteins in mammalian systems, stable 

19 expression of perturbagens in cell lines is preferred. For example, sequences encoding 

20 perturbagens can be transformed or introduced into cell lines using expression vectors 

21 which may contain viral origins of replication and/or endogenous expression elements 

22 and a selectable marker gene on the same or on a separate vector. Alternatively, cells can 

23 be transfected using, for instance, retroviral, adenoviral, or adeno-associated viral agents 

24 as delivery systems for the perturbagen. For example, retroviral vectors (e.g. LRCX, 

25 Clontech) may be used to introduce and express perturbagens in a variety of mammalian 

26 cell cultures. Such vectors may rely on the virus' own 5' LTR as a means of driving 

27 perturbagen expression or may utilize alternative promoters/enhancers (e.g. those of 

28 CMV, RSV and SV40, PIND) to regulate perturbagen expression levels. 

29 In a preferred embodiment, timing and/or quantity of expression of the 

30 recombinant protein can be controlled using an inducible expression construct. Inducible 

3 1 constructs and systems for inducible expression of recombinant proteins will be well 
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1 known to those skilled in the art. Examples of such inducible promoters or other gene 

2 regulatory elements include, but are not limited to, tetracycline, metallothionine, 

3 ecdysone, and other steroid-responsive promoters, rapamycin responsive promoters, and 

4 the like (No, et al, Proa Natl Acad Sci USA, 93:3346-51 (1996); Furth, et al, Proa 

5 Natl Acad Set USA, 91:9302-6 (1994)). Additional control elements that can be used 

6 include promoters requiring specific transcription factors such as viral, particularly HIV, 

7 promoters. In one in embodiment, a Tet inducible gene expression system is utilized. 

8 (Gossen et al, Proc. Natl Acad. Sci. USA, 89:5547-51 (1992); Gossen, et al, Science, 

9 268:1766-69 (1995)). Tet Expression Systems are based on two regulatory elements 

10 derived from the tetracycline-resistance operon of the E. coli TnlO transposon — the 

1 1 tetracycline repressor protein (TetR) and the tetracycline operator sequence (tetO) to 

12 which TetR binds. Using such a system, expression of the recombinant protein is placed 

13 under the control of the tetO operator sequence and transfected or transformed into a host 

14 cell. In the presence of TetR, which is co-transfected into the host cell, expression of the 

15 recombinant protein is repressed due to binding of the TetR protein to the tetO regulatory 

16 element. High-level, regulated gene expression can then be induced in response to 

17 varying concentrations of tetracycline (Tc) or Tc derivatives such as doxycycline (Dox), 

18 which compete with tetO elements for binding to TetR. Constructs and materials for tet 

19 inducible gene expression are available commercially from CLONTECH Laboratories, 

20 Inc., Palo Alto, CA. 

21 When used as a component in an assay system, the gene protein may be labeled, 

22 either directly or indirectly, to facilitate detection of a complex formed between the gene 

23 protein and a test substance. Any of a variety of suitable labeling systems may be used 

i 

24 including but not limited to radioisotopes such as I; enzyme labeling systems that 

25 generate a detectable calorimetric signal or light when exposed to substrate; and 

26 fluorescent labels. Where recombinant DNA technology is used to produce the gene 

27 protein for such assay systems, it may be advantageous to engineer fusion proteins that 

28 can facilitate labeling, immobilization and/or detection. 

29 Indirect labeling involves the use of a protein, such as a labeled antibody, which 

30 specifically binds to the gene product. Such antibodies include but are not limited to 
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1 polyclonal, monoclonal, chimeric, single chain, Fab fragments and fragments produced 

2 by a Fab expression library. 

3 In some instances, a preliminary selection is performed to verify that the host cells 

4 have been successfully transformed/transfected. Following the introduction of the vector, 

5 cells are allowed to grow in enriched media, and are then switched to selective media. 

6 The selectable marker confers resistance to the selective agent, and thus, only those cells 

7 that successfully express the introduced sequences survive in the selective media. Any 

8 number of selection systems may be used to recover transformed cell lines. These 

9 include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 

10 phosphoribosyltransferase genes, for use in tk- or apr- cells, respectively (see e.g. Wigler, 

11 M. et al. (1977) "Transfer of purified herpes virus thymidine kinase gene to cultured 

12 mouse cells." Cell 1 1:223-32; Lowy, I. et al. (1980) "Isolation of transforming DNA: 

13 cloning the hamster aprt gene." Cell 22:817-23). Also antimetabolite, antibiotic, or 

14 herbicide resistance can be used as the basis for selection. For example, dhfr confers 

15 resistance to methotrexate,; neo confers resistance to the aminoglycosides, neomycin and 

16 G-418, and als and pat confer resistance to chlorsulfuron and phosphinotricin 

17 acetyltransferase, respectively, (see Wigler, M. et al. (1980) "Transformation of 

18 mammalian cells with an amplifiable dominant-acting gene." PNAS 77:3567-70; 

19 Colbere-Garapin, F. et al (1981) "A new dominant hybrid selective marker for higher 

20 eukaryotic cells." J. MoL Biol 150:1-14). Additional selectable genes have been 

21 described, e.g. trpB and hisD, which alter cellular requirements for metabolites. Visible 

22 markers, e.g. anthocyanins, green, red or blue fluorescent proteins (Clontech), B 

23 glucuronidase and its substrate B glucuronide, or luciferase and its substrate luciferin, 

24 may also be used. Resistant clones containing stably transformed cells may be 

25 propagated using tissue culture techniques appropriate to the cell type. 

26 Host cells transformed/transfected with nucleotide sequences encoding for the 

27 perturbagen or target of interest may be cultured under conditions suitable for the 

28 expression and recovery of the protein from cell culture. For example, the protein 

29 produced by a transformed transfected cell may be secreted when the selected expression 

30 vector incorporates signal sequences that direct secretion of the perturbagen through a 

31 prokaryotic or eukaryotic cell membrane. 
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1 Signal sequences also may be selected so as to direct the perturbagen to a 

2 particular intra-cellular compartment (Bradshaw, R.A. (1989) "Protein translocation and 

3 turnover in eukaryotic cells." Trends Biochem Sci 14(7):276-9). Perturbagen sequences 

4 may be isolated or purified from recombinant cell culture by methods heretofore 

5 employed for other proteins, e.g. native or reducing SDS gel electrophoresis, salt 

6 precipitation, isoelectric focusing, immobilized pH gradient electrophoresis, solvent 

7 fractionation, and chromatography such as ion exchange, gel filtration, immunoaffinity, 

8 and ligand affinity. 

9 H. Host Cells 

10 Host cell lines for use in the methodology described herein typically embody 

11 desirable traits such as 1) short cell cycle (i.e. 20-36 hr. doubling time), 2) amenability to 

12 high throughput procedures (e.g. FACS) without undue loss of membrane integrity or 

13 viability, 3) susceptibility to standard techniques designed to introduce reporter 

14 constructs and other forms of foreign DNA, and 4) exhibition of a readily selected 

15 phenotype (or its correlative marker gene expression). In this example, it is also 

16 advantageous if the host cell line chosen for perturbagen selection contains a physiology 

17 that is neither dependent upon, nor inordinately sensitive to, alterations in the [3-catenin - 

1 8 TCF4 pathway. For example, if a cell responds to one or more of the components of the 

19 bioassay (e.g. (3-catenin S45Y) or a perturbagen acting on said pathway of interest, by 

20 exhibiting a lower viability and/or capacity to compete in the population, for example, 

21 reducing said cell's growth rate, such a cell line would be less desirable than others that 

22 were insensitive to such changes. One non-limiting example of a satisfactory and 

23 acceptable cell line is BDEK 293. HEK 293 is highly susceptible to retroviral infection 

24 and other methods of introducing foreign genetic materials and can express/maintain said 

25 materials for long periods of time using a variety of selectable markers common to the 

26 field (e.g. neomycin, puromycin). In addition, previous studies that elucidated some of 

27 the key functions of (3-catenin were performed in HEK 293 cells, thus suggesting that the 

28 pathway can be altered in HEK293 cells without undue effects on cell viability (see, for 

29 example, Van Gassen, G. et al. (2000) Evidence that the beta catenin nuclear 

30 translocation assay allows for measuring presenilin 1 dysfunction." Mol Med 6(7): 570- 

31 80). 
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1 Reporter cell lines consist of a host cell that contains two components; i) a 

2 reporter gene operably-linked to a cis-acting promoter and ii) constitutive expression of a 

3 molecule that activates the TCF-(3-catenin pathway. The terms "operably-associated" 

4 and "operably-linked" refer to functionally related nucleic acids. A promoter is 

5 "operably-associated" or "operably-linked" with a coding sequence if the promoter 

6 controls or regulates the transcription of the gene to which it is linked. 

7 Favorable reporter lines exhibit several properties including 1) a large (>50-fold) 

8 "induced signal" to "uninduced signal" ratio, 2) a minimal number (<5%) of uninduced 

9 ("dim") cells when the population has been fully induced and, 3) a minimal number 

10 (<5%) of induced ("bright") cells when the populations is grown under non-inducing 

1 1 conditions. One example of an acceptable reporter cell line is a subline of HEK 293 

12 referred to herein as S4535 (also referred to as S45-35). In addition to containing an 

13 activated form of p-catenin (e.g. p-cat S45Y), clone S4535, contains an artificial or 

14 synthetic promoter made up of four tandem repeats of the TBE-2 sequence (5' 

15 GCTTTGATC), operably-linked in cis to the reporter gene, Green Fluorescent Protein 

16 (EGFP, Clontech, also, see, He, T.C. et al. (1998) "Identification of c-Myc as a target of 

17 the APC pathway." Science, 281: 1509-12). Though this arrangement of promoter 

18 sequences and reporter gene performs adequately in the bioassay described, alternative 

19 components can be substituted to create equally efficient reporter lines. For instance, the 

20 regulatory region of the reporter need not be limited to the TBE-2 element as variations 

21 in both the sequence and number of tandemly- aligned TBE-2 or TBE-2-related cassettes 

22 can be successfully employed to create alternative p-catenin-TCF reporters. Furthermore, 

23 naturally occurring promoters such as those that lie upstream of the coding sequences of 

24 c-Myc, MMP-7, Cyclin Dl and other genes that respond to activation of the (3-catenin- 

25 TCF pathway, can be fused to a variety of reporter genes (e.g. GFP, YFP, BFP, 

26 luciferase, p-galactosidase) to create functional reporter constructs (see, for example, 

27 Brabletz, T. et al. (1999) "beta-catenin regulates the expression of the matrix 

28 metalloproteinase-7 in human colorectal cancer." Am J Pathol 155(4): 1033-8). Thus, the 

29 size of the P-catenin-TCF-responsive fragment can vary considerably. Synthetic 

30 promoters, which often consist of variations of the P-catenin-TCF consensus binding 

31 sequence can be relatively small (e.g. 17-70 nucleotides) while naturally occurring P- 
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1 catenin-TCF promoters sequences can be considerably larger (e.g. >500 bp) depending 

2 upon the breakpoints used to isolate the element, 

3 The second component contained in the S4535 cell line is an activated form of p~ 

4 catenin called p-cat S45Y. In the P-cat S45Y allele, standard molecular techniques have 

5 been employed to substitute a tyrosine residue for serine at amino acid residue 45. As a 

6 result of this alteration, a phosphorylation site that is normally critical for regulation of 

7 free, intracellular p -catenin levels is removed, thus creating a molecule that has an 

8 extended half -life and is, consequently, more active. As the change in p-cat S45 Y does 

9 not alter the ability of the molecule to interact with TCF-4, addition of this mutant allele 

10 to HEK 293 cells leads to heightened expression of p-catenin-TCF-4 responsive genes. 

1 1 Thus, cells that have successfully incorporated both the p-cat S45 Y expression vector and 

12 the TBE-2 reporter construct constitutively express GFP and can be selected by FACS. It 

13 should be noted that alternatives to the p-cat S45Y allele can be employed. Thus 

14 mutations that convert amino acid residues 33 or 37 (both serine residues) to tyrosine or 

15 phenylalanine respectively, can be utilized to activate the pathway in question. 

16 Not all of the HEK293 cells that contain both the p-cat S45Y and the TBE-2 

17 reporter construct respond equally to activation or deactivation of the P-catenin-TCF 

18 signal pathway. In some instances, the reporter construct may exhibit a constitutive 

19 phenotype due to insertion of the retroviral vector containing the reporter into a 

20 chromosomal position that contains a strong, constitutive, enhancer or promoter in a 

21 nearby region. 

22 Other cells may exhibit little or no signal due to either i) insertion of the reporter 

23 element into a region of the genome that is transcriptionally silent or ii) introduction of a 

24 deleterious mutation in the p-cat S45Y during the process of transduction (the retroviral 

25 infection step is referred to herein as a transduction.) To eliminate these non-responsive 

26 cells and identify a clonal line that is readily modulated by perturbagens, a simple 

27 procedure is undertaken (Figure 3). In the first step, HEK 293 cells containing the 

28 reporter construct and the activated form of P-catenin undergo multiple cycles of FACS 

29 to isolate GFP + (bright) cells. In this fashion, it is assured that P-cat S45Y is intact and 

30 that the reporter construct has not been inserted into a region of the genome that is 
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1 transcriptionally silent. In the second step, individual clones that express high levels of 

2 GFP are obtained by plating cells at low density. After expanding these clones to a 

3 sufficient population, small samples of each are then transduced with a third retroviral 

4 construct containing a dominant negative allele of TCF4 called TCF4A30. The TCF4A30 

5 cDNA is a truncated form of TCF4 in which 30 amino acids responsible for the 

6 interaction with P-catenin have been removed from the N-terminus of the molecule. As a 

7 result of this deletion, TCF4A30 is capable of binding to the TCF4 DNA consensus 

8 binding site (e.g. the TBE2 sequence of the reporter) yet is unable to induce transcription 

9 due to the absence of an activation domain normally provided by p-catenin. Samples of 

10 each clone containing all three constructs are then analyzed by FACS to determine which 

11 of the clones are responsive to the dominant negative effects of TCF4A30. Cultures that 

12 respond to the presence of TCF4A30 by shifting the fluorescent intensity of the 

13 population from "bright" to "dim", represent clones in which both the reporter and the p~ 

14 cat S45Y allele of p-catenin have been inserted into chromosomal positions that permit 

15 modulation of the signal pathway. By returning to the original culture (e.g. +p-cat S45Y, 

16 + TBE2-reporter, -TCF4A30) from which these clones were derived, suitable reporter cell 

17 populations (e.g. S4535) can be expanded and utilized for subsequent perturbagen 

18 screens. 

19 As one familiar with the art is aware, there are several variations to the procedures 

20 described above that could be used to achieve the same end results. For instance, FACS 

21 could be replaced with antibody affinity chromatography methods (using cell surface 

22 localized reporter) to segregate responsive from non-responsive clones (see, for example, 

23 Larsson, P.H. et al. (1989) "Improved cell depletion in a panning technique using 

24 covalent binding of immunoglobulins to surface modified polystyrene dishes." J Immunol 

25 Methods. 1 16(2):293-8; Contractor, S.F. et al. (1988) "Human placental cells in culture: a 

26 panning technique using a trophoblast-specific monoclonal antibody for cell separation." 

27 JDev PhysiolAQ{\)'Al -51). Alternatively, the gates used to sort bright (or dim) cells in 

28 the FACS procedures could be adjusted to increase or decrease the enrichment 

29 procedures and thus alter the population of cells being collected and studied (see, for 

30 example Shapiro, H.M. (1995) "Practical Flow Cytometry" Wilely-Liss publishers). 

31 Furthermore, alleles other than p-catenin S45Y and TCFA30. 
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1 In addition, it is understood that there are many other suitable host cell lines 

2 including, but not limited to, transformed and/or immortalized cell lines derived from (i) 

3 HeLa (ATCC # CCL-2 ) and (ii) the CHO ( ATCC # CCL-61, see, for example Tetsu et 

4 ah (1999) "p-catenin regulates expression of cyclinDl in colon carcinoma cells Nature 

5 398 (6726) 422-426; Sadot et al (1998) , "Inhibition of P-catenin-mediated 

6 transactivation by cadherin derivatives." PNAS 95, 15339-15344). Any cell line such as 

7 these can be substituted as a host cell in the invention and can readily be screened to 

8 identify p-catenin-TCF reporter sublines. In addition, secondary cell lines may be used 

9 to study the effects of perturbagens. Thus, for instance, perturbagens isolated in HEK 

10 293 cells may be introduced into, for example, HT-29 or SW620 colorectal 

1 1 adenocarcinoma cells (ATCC # HTB-38 and CCL-227 respectively) to study the effects 

12 of the perturbagen on cell lines that have a well-documented dependence on the P- 

13 catenin-TCF-APC pathway and/or have natural mutations in said pathway. 

14 Lastly, critical components needed to construct a reporter cell line can be introduced into 

15 the cell by a variety of methodologies. For instance, episomal vectors carrying each of 

16 the components can be transformed into the cell type of choice and selected to identify 

17 rare events in which the vector becomes stably integrated into the genome. More 

18 preferably, a population of host cells containing the relevant constructs can be 

19 constructed using standard retroviral technology (Palu, G. et al. (2000) "Progress with 

20 retroviral gene vectors." Rev Med Virol 10(3): 185-202). 

21 I. Screening for Biological Activity 

22 The phenotypic assay described herein selects for perturbagens that modulate the 

23 p-catenin-TCF pathway. The procedures used to screen libraries for such perturbagens 

24 include: 1) introducing perturbagen encoding sequences (libraries) into clonal reporter 

25 cell lines containing both the activated form of p-catenin and the TBE-2 regulated 

26 reporter constructs; 2) growing said cells under the appropriate conditions necessary to 

27 identify perturbagens that repress expression of the reporter; 3) screening said cells by 

28 FACS or alternative high-throughput methods in order to segregate cells with the 

29 appropriate phenotype (e.g. "dim"); 4) re-isolating perturbagen encoding sequences from 

30 sorted cell populations by various techniques (e.g. PCR) and constructing new, retroviral 

31 sublibraries from the PCR product; 5) enriching for perturbagens by recycling said 
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1 sequences through the screen; and optionally 6) performing secondary assays to test 

2 specificity and scope of the agent. 

3 Various methods and instrumentation familiar to those who are skilled in the art 

4 are used to screen and test perturbagens. The media, supplements, and reagents used in 

5 culturing, packaging, and maintenance of HEK 293 cells, HS293gp packaging cell lines, 

6 and additional lines (e.g. HT29, SW620) can be purchased from a variety of commercial 

7 sources (Life Technologies, Clonetics, Cocalico Biologicals Inc., ATCC). It should be 

8 noted that although a particular set of procedures and media formulations are used in the 

9 work described herein, alternatives can be substituted with little or no effect. For 

10 instance, in most cases, retroviral packaging was accomplished using CaCl 2 . Though this 

11 is the preferred method of introducing retroviral vectors into 293gp packaging cells, 

12 alternative procedures such as LipofectAmine, may be used. Molecular techniques used 

13 in procedures such as genomic DNA isolation, PCR amplification, DNA endonculease 

14 digestion, ligation, cloning, and sequencing utilize common reagents that are supplied 

15 commercially (see, for example, Qiagen, New England BioLabs, Stratagene). 

16 Fluorescent activated cell sorting and analysis may be performed on a Coulter EPICS 

17 Elite Cell Sorter using EXPO software. Again, alternative reagents and equipment, such 

18 as the MoFlo R High-Speed Cell Sorter (Cytomation), are compatible with these 

19 procedures and may be substituted with little or no effect. 

20 To identify agents that deactivate the pathway, a retroviral expression library is 

21 introduced into Clone S4535 cells and grown/expanded over the course of several days in 

22 a selective environment (Figure 4). Under these conditions the vast majority of cells 

23 (>99%) do not contain a relevant perturbagen molecule and are "bright" due to the 

24 expression of high levels of the correlative reporter gene (e.g. red, green, or blue 

25 fluorescent protein). In contrast, a small fraction of the population appear "dim", a 

26 phenotype that results from, for instance, i) the presence of a perturbagen that inhibits or 

27 disrupts the P-catenin-TCF4~APC pathway or ii) loss of reporter expression due to cell 

28 death. "Dim" cells are then separated from the rest of the population by FACS and 

29 processed to re-isolate the perturbagen encoding sequences. Subsequently, these 

30 sequences are recycled through the bio-assay to further enrich for perturbagens that 

3 1 disrupt the pathway. 
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1 Several methods may be used to retrieve the perturbagen sequences from cells 

2 that have been sorted. For instance, perturbagen-encoding sequences may be recovered 

3 by PCR (see, for example, Schott, B. (1997) "Efficient recovery and regeneration of 

4 integrated retroviruses." Nucleic Acids Res. 25(14):2940-2). To accomplish this, 

5 genomic DNA (derived from cells taken from the FACS sorting procedures) is used as 

6 the template for PCR amplification. Complex mixtures with diversities of greater than 

7 50,000 can be amplified efficiently using oligonucleotide primers that flank the 

8 perturbagen encoding sequence. These sequences can subsequently be ligated into an 

9 appropriate retroviral vector, and introduced into a fresh population of, e.g., S4535 cells 

10 for additional rounds of screening and enrichment. Alternatively, retrieval of the 

1 1 perturbagen may be accomplished by reactivating the inserted retroviral vector that 

12 contains the perturbagen-encoding sequence. Specifically, host cells containing the 

13 perturbagen-encoding retrovirus are transformed with sequences that encode the 

14 necessary retroviral gag, pol and envelope proteins. As a result of these procedures, 

15 retroviral virions that contain the perturbagen-encoding sequences are released and can 

16 be isolated in the form of a viral supernatant. These supernatants can then be utilized to 

17 infect fresh populations of, e.g., S4535 cells to recycle the sequences through the screen 

1 8 for additional enrichment. 

19 Secondary cell lines may optionally be employed to test individual perturbagens 

20 for the ability to down-regulate P-catenin-TCF targets. For instance, perturbagens 

21 isolated in HEK 293 cells can be introduced into HT29 or SW610 colon cancer cells 

22 carrying a TBE-2-GFP reporter construct, and studied to determine whether the 

23 perturbagen' s action is confined to a single cell type (e.g. HEK 293) or is broader in its 

24 application. In addition, these same cell lines can be used to study whether the 

25 perturbagen induced down-regulation of the p-catenin-TCF-APC pathway stimulates 

26 secondary phenotypes (such as cell cycle arrest and/or apoptosis) in cells where the p~ 

27 catenin-TCF-APC pathway plays a more critical role. As one non-limiting example of 

28 how this might be accomplished, perturbagen sequences that induce a shift in the reporter 

29 expression levels could be introduced into (for instance) HT29 adenocarcinoma cells and 

30 tested for cytostatic or cytotoxic properties in a high throughput assay. In still another 

31 alternative, the effects of perturbagens on various aspects of tumor progression including 
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1 cell migration in colonic crypts, frequency of polyp formation, metastasis and other 

2 aspects colon cancer development, can be studied in whole animals by introducing 

3 perturbagen encoding sequences into either wild type or mutant mice carrying defects in 

4 one or more genes involved in the P-catenin-APC-TCF pathway, (e.g transgenic mice, 

5 see, for example, Sarao R, and Dumont DJ. (1998) "Conditional transgene expression in 

6 endothelial cells " Transgenic Res. 7(6):421-7; Wight D.C., Wagner T.E. (1994) 

7 "Transgenic mice: a decade of progress in technology and research." Mutat Res 

8 3G7(2):429-40; Edelmann W. et al. (1999) "Tumorigenesis in Mlhl and Mlhl/Apcl638N 

9 mutant mice." Cancer Res 59(6): 1301-7). 

10 J. Cellular targets 

11 In other embodiments, the invention encompasses the polypeptide, ribonucleotide, 

12 or polynucleotide sequence of the target (or fragment of each target) that is identified 

13 with each perturbagen agent, as well as the gene encoding each target and relevant 

14 fragments of said gene. 

15 Targets of specific perturbagens may be identified by several means. For 

16 instance, perturbagens can be modified with homo- or hetero- bifunctional coupling 

17 reagents and targets can be identified by chemical cross-linking techniques (see, for 

18 example, Tzeng, MC. et al. (1995) "Binding proteins on synaptic membranes for 

19 crotoxin and taipoxin, two phospholipases A2 with neurotoxicity." Toxicon. 33(4):451-7; 

20 Cochet, C. et al. (1988) "Demonstration of epidermal growth factor-induced receptor 

21 dimerization in living cells using a chemical covalent cross-linking agent." J Biol Chem. 

22 263(7):3290-5). Alternatively, one may use various techniques in column affinity 

23 chromatography, immunoprecipitation, or one of several high throughput peptide array 

24 platforms, to isolate peptides that react with the target of choice (see, for example, Hentz, 

25 N.G. and Daunert, S. (1996) "Bifunctional fusion proteins of calmodulin and protein A as 

26 affinity ligands in protein purification and in the study of protein-protein interactions." 

27 Anal Chem. 68(22):3939-44; Figeys D and Pinto D. (2001) "Proteomics on a chip: 

28 promising developments." Electrophoresis 22(2):208-16' Bichsel V.E. et al. "Cancer 

29 proteomics: from biomarker discovery to signal pathway profiling." Cancer J 7(1):69- 

30 78). In some instances, a particular phenotype may be the result of a perturbagen 

31 differentially regulating a distinct combination of genes. For example, a perturbagen 
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1 might, through its interaction with a particular transcription factor which, in turn, 

2 recognizes a particular DNA promoter sequence, elevate the expression of two or more 

3 target genes that act in concert to elicit a unique phenotype (e.g. viral resistance). In 

4 these cases, each of the genes whose levels of expression are altered by the perturbagen 

5 can be considered to be perturbagen targets. Such targets can be identified by a variety of 

6 techniques including (but not limited to) SAGE and expression profiling via microarray 

7 analysis (see, for instance, Cummings C A. and Relman D.A. (2000) "Using DNA 

8 Microarrays to Study Host-Microbe Interactions." Emerg Infect Dis. 6(5):5 13-525; 

9 Yamamoto M, et al. (2001) "Use of serial analysis of gene expression (SAGE) 

10 technology. J Immunol Methods. 2001 Apr;250(l-2):45-66), 

11 A preferred method of target identification involves application of variants of the 

12 standard two-hybrid technology. See, e.g., U.S.S.N. 09/193,759 and WO 00/29565 

13 "Methods for validating polypeptide targets that correlate to cellular phenotypes", the 

14 entire disclosures of which are incorporated by reference herein. Generally stated, the 

15 two-hybrid procedure is a quasi-genetic approach to detecting binding events. This assay 

16 often is performed in yeast cells (although it can be adapted for use in mammalian and 

17 bacterial cells), and relies upon constructing two vectors; the first having an interaction 

18 probe or bait (that in this case, will be the perturbagen) that typically is fused to a DNA 

19 binding domain ("BD") moiety, and a second vector having an interaction target or prey 

20 (a cDNA library) that is typically fused to a DNA transcriptional moiety (the "activation 

21 domain" or "AD"). Neither of the two fusion proteins can, individually, induce 

22 transcription of the reporter gene. Yet when the bait and prey interact, the AD and BD 

23 moieties are brought into sufficient physical proximity to result in transcription of a 

24 reporter gene (e.g., the His3 gene or lacZ gene) located downstream of the bound 

25 complex (Figure 5). Prey/bait interactions are then detected by identifying yeast cells 

26 that are expressing the reporter gene - e.g. which express lacZ or are able to grow in the 

27 absence of histidine. 

28 A variety of yeast host strains known in the art are suitable for use for identifying 

29 targets of individual perturbagens. One of ordinary skill will appreciate that a number of 

30 factors may be considered in selecting suitable host strains, including but not limited to 

31 (1) whether the host cells can be mated to cells of opposite mating type (i.e., they are 



46 



1 haploid), and (2) whether the host cells contain chromosomally integrated reporter 

2 constructs that can be used for selections or screens (e.g., His3 and LacZ), Although 

3 mating can be desirable in some embodiments, it is not strictly necessary for purposes of 

4 practicing the present invention. For example, the mating procedures can be eliminated 

5 by introducing the bait and prey constructs into a single yeast cell, whereupon the screens 

6 can be performed on the haploid cell. 

7 Generally, either Gal4 strains or LexA host strains may be used with the 

8 appropriate reporter constructs. Representative examples include strains yVT 69, yVT 

9 87, yVT96, yVT97, yVT98 and yVT99, yVTlOO, yVT360. Additionally, those of 

10 ordinary skill will appreciate that the host strains used in the present invention may be 

1 1 modified in other ways known to the art in order to optimize assay performance. For 

12 example, it may be desirable to modify the strains so that they contain alternative or 

13 additional reporter genes that respond to two-hybrid interactions. 

14 The following host yeast strains are thus constructed to have the indicated 

15 characteristics: 

16 YVT69: yVT69 (mat □, ura3-52, his3-200, ade2-101, trpl-901, leu2-3, 112, gal4A, 

17 met", gal80A, URA3::GALluAS-GALl T ATA-lacZ) was obtained from Clontech (Y187). 

18 YVT87: yVT87 (Mat-a ura3-52,his3-200,trpl-901 ,LexA op (x6 )-LEU2-3,112) was 

19 obtained from Clontech (EGY48), 

20 YVT96: The starting strain was YM4271 (Liu, J. et al., 1993) MATa, ura3-52 his3-200 

21 ade2-101 ade5 lys2-801 leu2-3, 112 trpl-901 tyrl-501 gal4A gal80A ade5::hisG. 

22 YM4271 was converted to yVT96, MATa ura3-52 his3-200 ade 2-101 ade5 lys2::GAL2- 

23 URA3 leu2-3, 1 12 trpl-901 tyrl-501 ga!4D gal80A ade5::hisG by homologous 

24 recombination of Reporter 1 to the LYS2 locus. The integration is confirmed by PCR. 

25 YVT97: The starting strain is YM4271 (Liu, J. et al., 1993) MATa, ura3-52 his3-200 

26 ade2-101 ade5 lys2-801 leu2-3, 112 trpl-901 tyrl-501 gal4A gal80A ade5::hisG. 

27 YM4271 will be converted to yVT97, MATa ura3-52 his3::GALl or GAL7-HIS3 ade2- 

28 101 ade5 lys2-801 leu2-3, 1 12 trpl-901 tyrl-501 gal4A gal80A ade5::hisG by the steps of 

29 (a) converting from MATa to MATa via transient expression of the HO endonuclease, 

30 Methods in Enzymology Vol. 194: 132-146 (1991) and (b) integrating either of Reporters 



47 



1 3 or 4 at the HIS3 locus via homologous recombination. The integration is confirmed by 

2 PCR. 

3 YVT98; The starting strain was EGY48 (Estojak, J. Et al., 1995) MATa, ura3 his3 trpl 

4 leu2::LexAop(x6)-LEU2. EGY48 was converted to strain yVT98 MATa ura3 his3 trpl 

5 leu2::lexAop(x6)-LEU2 lys2::lexAop(8x or 2x)-LacZ by homologous recombination of 

6 Reporter 6 into the LYS2 locus. 

7 YVT99: The starting strain was EGY48 (Estojak, J. Et al., 1995) MATa, ura3 his3 trpl 

8 leu2::LexAop(x6)-LEU2. EGY48 was converted to strain yVT99 MATa ura3 his3 trpl 

9 leu2::lexAop(x6)-LEU2 lys2::lexAop(8x or 2x)-URA3 by homologous recombination of 

10 Reporter 2 into the LYS2 locus and by switching the mating type from MATa to MATa 

1 1 via transient expression of the HO endonuclease. 

12 YVT100: The starting strain was YM4271 (Liu, J. et al., 1993) MATa, ura3-52 his3-200 

13 ade2-101 ade5 lys2-801 leu2-3, 112 trpl-901 tyrl-501 gal4A gal80A ade5::hisG. 

14 YM4271 was converted to yVTlOO, MATa ura3-52 his3-200 ade2-101 ade5 

15 lys2::lexAop(8x or 2x>URA3 leu2-3, 1 12 trpl-901 tyr-501 gal4A gal80A ade5::hisG by 

16 homologous recombination of Reporter 2 to the LYS2 locus. The integration was 

1 7 confirmed by PCR. 

18 YVT360: yVT360 (mat a, trpl-901, leu2-3,l 12, ura3-52, his3-200, gal4 A, gal 80A, 

19 LYS2::GAL1 UA s-GAL1tata-HIS3, GAL2 uas -GAL2 T ata-ADE2, URA3:MEL1 uas - 

20 MELlrATA-lacZ) was obtained from Clontech (AH109). 

21 Exemplary yeast-reporter strains are constructed using a variety of standard 

22 techniques. Many of the starting yeast strains already carry multiple mutations that lead 

23 to an auxotrophic phenotype (e.g. ura3-52, ade2-101). When necessary, reporter 

24 constructs can be integrated into the genome of the appropriate strain by homologous 

25 recombination. Successful integration can be confirmed by PCR. Alternatively, 

26 reporters may be maintained in the cells episomally. 

27 The yeast two-hybrid reporter gene typically is fused to an upstream promoter 

28 region that is recognized by the BD, and is selected to provide a marker that facilitates 

29 screening. Examples include the lacZ gene fused to the Gall promoter region and the 

30 His3 yeast gene fused to Gall promoter region. A variety of yeast two-hybrid reporter 

3 1 constructs are suitable for use in the present invention. One of ordinary skill will 
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1 appreciate that a number of factors may be considered in selecting suitable reporters, 

2 including whether (1) the reporter construct provides a rigorous selection (i.e., yeast cells 

3 die in the absence of a protein-protein or peptide-protein interaction between the bait and 

4 prey sequences), and/or (2) the reporter construct provides a convenient screen (e.g., the 

5 cells turn color when they harbor bait and prey sequences that interact). Examples of 

6 desirable reporters include (1) the Urdi gene, which confers growth in the absence of 

7 uracil and death in the presence of 5-fluoroorotic acid (5-FOA); (2) the His3 gene, which 

8 permits growth in the absence of histidine; (3) the LacZ gene, which is monitored by a 

9 colorimetric assay in the presence/absence of beta-galactosidase substrates (e.g. X-gal); 

10 (4) the Leu2 gene, which confers growth in the absence of leucine; and (5) the Lys2 gene, 

1 1 which confers growth in the absence of lysine or, in the alternative, death in the presence 

12 of a-aminoadipic acid. These reporter genes may be placed under the transcriptional 

13 control of any one of a number of suitable cis-regulatory elements, including for example 

14 the Gal2 promoter, the Gall promoter, the Gal7 promoter, or the LexA operator 

15 sequences. 

16 The following are exemplary, non-limiting examples of such reporter constructs. 

17 Reporter 1 - (pVT85): This reporter comprises the URA3 gene under the transcriptional 

18 control of the yeast Gall upstream activating sequence (UAS). In order to facilitate 

1 9 integration of this reporter into the yeast chromosome in place of the Lysl coding region, 

20 the Gal2-Ura3 construct is flanked on the 5' side by the 500 base pairs that he 

21 immediately upstream of the coding region of the LYS2 gene and on the 3' side by the 

22 500 base pairs that lie immediately 3' of the coding region of the LYS2 gene. The entire 

23 vector is also cloned into the yeast centromere containing vector pRS413 (Sikorski, RS 

24 and Hieter, P., Genetics 122(1): 19-27 (1989) and can therefore be used episomally. This 

25 reporter is intended for use with a Ga/4-based two-hybrid system, e.g., Fields, S. and 

26 Song, O., Nature 340:245-246 (1989). 

27 Reporter 2 - (pVT86): This reporter is identical to reporter #1 except that the GAL2 

28 UAS sequences have been replaced with regulatory promoter sequences that contain 

29 eight LexA operator sequences (Ebina et al., 1983). The number of LexA operator 

30 sequences in this reporter may either be increased or decreased in order to obtain the 



49 



1 appreciate that a number of factors may be considered in selecting suitable reporters, 

2 including whether (1) the reporter construct provides a rigorous selection (i.e., yeast cells 

3 die in the absence of a protein-protein or peptide-protein interaction between the bait and 

4 prey sequences), and/or (2) the reporter construct provides a convenient screen (e.g., the 

5 cells turn color when they harbor bait and prey sequences that interact). Examples of 

6 desirable reporters include (1) the Urai gene, which confers growth in the absence of 

7 uracil and death in the presence of 5-fluoroorotic acid (5-FOA); (2) the His3 gene, which 

8 permits growth in the absence of histidine; (3) the LacZ gene, which is monitored by a 

9 colorimetric assay in the presence/absence of beta-galactosidase substrates (e.g. X-gal); 

10 (4) the Leu2 gene, which confers growth in the absence of leucine; and (5) the Lys2 gene, 

1 1 which confers growth in the absence of lysine or, in the alternative, death in the presence 

12 of ot-aminoadipic acid. These reporter genes may be placed under the transcriptional 

13 control of any one of a number of suitable cis-regulatory elements, including for example 

14 the Gal2 promoter, the Gall promoter, the Gall promoter, or the LexA operator 

15 sequences. 

16 The following are exemplary, non-limiting examples of such reporter constructs. 

17 Reporter 1 - (pVT85): This reporter comprises the URA3 gene under the transcriptional 

18 control of the yeast Gall upstream activating sequence (UAS). In order to facilitate 

19 integration of this reporter into the yeast chromosome in place of the Lysl coding region, 

20 the Gall-Urdh construct is flanked on the 5 5 side by the 500 base pairs that lie 

21 immediately upstream of the coding region of the LYS2 gene and on the V side by the 

22 500 base pairs that lie immediately 3' of the coding region of the LYS2 gene. The entire 

23 vector is also cloned into the yeast centromere containing vector pRS413 (Sikorski, RS 

24 and Hieter, P., Genetics 122(1): 19-27 (1989) and can therefore be used episomally. This 

25 reporter is intended for use with a Ga/4-based two-hybrid system, e.g., Fields, S. and 

26 Song, O., Nature 340:245-246 (1989). 

27 Reporter 2 - (pVT86); This reporter is identical to reporter #1 except that the GAL2 

28 UAS sequences have been replaced with regulatory promoter sequences that contain 

29 eight LexA operator sequences (Ebina et aL, 1983). The number of LexA operator 

30 sequences in this reporter may either be increased or decreased in order to obtain the 
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1 optimal level of transcriptional regulation. This reporter is intended to be used within the 

2 general confines of the LexA-based interaction trap devised by Brent and Ptashne- 

3 Reporter 3 - (pVT87): This reporter is comprised of the yeast His3 gene under the 

4 transcriptional control of the yeast Gall upstream activating sequence (UAS). In order to 

5 facilitate integration of this reporter into the yeast chromosome in place of the His3 

6 coding region the Gall-His3 construct is flanked on the 5 1 side by the 500 base pairs (bp) 

7 immediately upstream of the His3 coding region and on the 3 ! side by the 500 bp 

8 immediately 3' of the His3 coding region. The entire reporter is also cloned into the yeast 

9 centromere containing vector pRS415 and can therefore be used episomally. This 

10 reporter is intended for use with a GaW-based two-hybrid system. 

1 1 Reporter 4 - (pVT88): This reporter is identical to Reporter 3 except that the His3 gene 

12 is under the transcriptional control of Gal7 UAS sequences rather than the Gall UAS. 

13 The reporter is used with a GaW-based two-hybrid system. 

14 Reporter 5 - (pVT89): This reporter contains the bacterial LacZ gene under the 

15 transcriptional control of the Gall UAS. The entire reporter will be cloned into a yeast 

16 centromere-using vector, e.g., pRS413, and is used episomally. 

17 Reporter 6 - (pVT90): This reporter consists of the LacZ gene under the transcriptional 

18 control of eight LexA operator sequences. As for Reporter 2, the number of LexA operator 

19 sequences in this reporter may either be increased or decreased in order to obtain optimal 

20 levels of transcriptional regulation. Two features of this reporter facilitate integration of 

21 the reporter into the yeast chromosome in place of the Lys2 coding region. First, it is 

22 flanked on the 5' side by the 500 base pairs that lie immediately upstream of the coding 

23 region of the Lys2 gene and on the 3' side by the 500 base pairs that lie immediately 3' of 

24 the coding region of the Lys2 gene. Second, the neomycin (NEO) resistance gene has 

25 been inserted between the 5 f Lys2 sequences and the LexA promoter sequences. This 

26 reporter is used in conjunction with a LexA-based interaction trap, e.g. , Golemis, E.A., et 

27 al., (1996), "Interaction trap/two hybrid system to identify interacting proteins." Current 

28 Protocols in Molecular Biology, Ausebel et al., eds., New York, John Wiley & Sons, 

29 Chap. 20.1.1-20.1.28. 

30 In other embodiments, perturbagen-induced phenotypes may be the result of 

31 RNA-RNA, RNA-polypeptide, polypeptide-DNA, or RNA-DNA interactions. In cases 
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1 such as these, variations of the original two-hybrid theme may be applied to identify the 

2 target of the phenotypic probe. (See, for example, Li, J J. and Herskowitz, L (1993) 

3 Isolation of Orc6, a Component of the Yeast Origin Recognition Complex by a One- 

4 Hybrid System. Science, 262:1870-1874; Svinarchuk, F. et aL (1997) "Recruitent of 

5 transcription factors to the target site by triplex-forming oligonucleotides." NAR 25: 

6 3459-3464; Segupta, DJ. et al. (1999) "Identification of RNAs that bind to a specific 

7 protein using the yeast three-hybrid system." RNA 5:596-601; Harada, K. et al. (1996) 

8 "Selection of RNA-binding peptides in vivo." Nature 14;380(6570): 175-9; SenGupta, 

9 DJ. et al. (1996) "A three-hybrid system to detect RNA protein interactions in vivo." 

10 PNAS 93:8496-8501). For instance, if evidence exists that a perturbagen is acting as an 

1 1 anti-sense agent, it is necessary to construct a system where the association of the DNA 

12 binding domains and the transcriptional activation domains is dependent upon and RNA- 

13 RNA interaction. To accomplish such a screen, four unique vectors are created (Figure 

14 6). The first vector consists of the DNABP (e.g. GAL4 BD) described previously, linked 

15 to a specific RNA binding protein, arbitrarily called "RNABP-A" (e.g. the Rev 

16 responsive element RNA binding protein, RevMlO, see Putz, U. et al. (1996) "A tri- 

17 hybrid system for the analysis and detection of RNA-protein interactions." NAR 24:4838- 

18 4840). Vector #2 contains the transcriptional activation domain (e.g. GAL4 AD) linked 

19 to a second RNA binding protein ("RNABP-B", e.g. the MS 2 coat protein of the MS2 

20 bacteriophage, see for example, SenGupta, DJ. et al. (1996) "A three hybrid system to 

21 detect RNA-protein interactions in vivo." PNAS 93:8496-8501). The third vector encodes 

22 an RNA molecule that is recognized by RNABP-A (e.g. the RRE sequence, Zapp, M.L. 

23 and Green M.R/ "Sequence-specific RNA binding by the HIV-1 Rev protein (1989) 

24 Nature, 32:714-716) fused to a sequence encoding the RNA perturbagen, while the final 

25 vector encodes a fourth hybrid, the RNA sequence recognized by RNABP-B (e.g. the 21 

26 base nucleotide RNA stem-loop structure of MS2, see Uhlenbeck, O.C. et. al. (1983) 

27 "Interaction of R17 coat protein with its RNA binding site for translational repression." J. 

28 Biomol Struct. Dyn. 1, 539-552) linked to a library of expressed sequences (e.g. a library 

29 of mRNA molecules). When all four vectors are stably maintained in a yeast cell 

30 containing the necessary reporter construct(s) (e.g. Pgau-LACZ), the cellular target RNA 

31 molecule of any given RNA perturbagen can be identified. 
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1 Target sequences or fragments thereof can vary greatly in size. Some target fragments 

2 can be as small as ten amino acids in length. Alternatively, target sequences can be 

3 greater than 10 amino acids but less than thirty amino acids in length. Still other targets 

4 can be greater than thirty amino acids in length but shorter than 60 amino acids in length. 

5 Still other targets are cellular proteins or subunits or domains therein of more than 60 

6 amino acids in length. Still other targets are cellular proteins or subunits or domains 

7 there of more than 60 amino acids in length. Still other targets are cellular proteins or 

8 subunits or domains there of more than 60 amino acids in length. In addition, for reasons 

9 described previously, the sequences encoding targets can vary greatly due to allelic 

10 variation, duplications and closely related gene family members. That said, the invention 

1 1 also encompasses variants of said targets. A preferred target variant is one which has at 

12 least about 80%, alternatively at least about 90%, and in another alternative at least about 

13 95% amino acid sequence identity to the original target amino acid sequence and which 

14 contains at least one functional or structural characteristic of the original target. 

15 K. Databases 

16 The compositions, relations and phenotypic effects yielded by the methodology 

17 described herein may advantageously be placed into or stored in a variety of databases. 

18 As one example, a database may include information about one or more targets identified 

19 by the methods herein, including for example sequence information, motif information, 

20 structural information and/or homology information. The database may optionally 

21 contain such information regarding perturbagen agents, and may correlate the 

22 perturbagen information to corresponding target information. Further helpful database 

23 aspects may include information regarding, e.g., variants or fragments of the above. The 

24 database may also correlate the indexed compounds to, e.g., immunoprecipitation data, 

25 further yeast n-hybrid interaction data, genotypic data (e.g., identification of disrupted 

26 genes or gene variants), and with a variety phenotypic data. Such databases are 

27 preferably electronic, and may additionally be combined with a search tool so that the 

28 database is searchable. 

29 L. Production of antibodies 

30 An additional embodiment of the invention includes antibodies that recognize the 

3 1 perturbagen itself, cellular targets of the perturbagen, or one or more epitopes of the 
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1 foregoing. Such reagents may include, but are not limited to, polyclonal, monoclonal, 

2 humanized, chimeric, and single chain antibodies, Fab fragments, F(ab')2 fragments, 

3 fragments produced by a Fab expression library, anti-idiotypic (anti-Id) antibodies, and 

4 epitope-binding fragements of any of the above. Antibodies directed against 

5 perturbagens or cellular targets may be useful for a variety of purposes including i) 

6 therapeutics, ii) diagnostic assays, iii) cytoimmunology, iv) target identification, and v) 

7 purification. 

8 For the production of antibodies, various hosts including goats, rabbits, rats, mice, 

9 humans and others may be immunized by injection with a perturbagen, target or any 

10 fragment thereof which has immunogenic properties. Depending on the host species, 

1 1 various adjuvants may be used to increase immunological response. Such adjuvants 

12 include, but are not limited to Freund's (complete and incomplete), mineral gels such as 

13 aluminum hydroxide, and surface-active substances such as lysolecithin, pluronic 

14 polyols, polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants 

15 used in humans, BCG (bacilli Calmette-Guerin) and Corynebacterium parvum are 

16 especially preferable. 

17 Polyclonal antibodies are heterogeneous populations of antibody molecules 

18 derived from the sera of animals immunized with an antigen, such as a given perturbagen, 

19 target, or an antigenic functional derivative thereof. For the production of polyclonal 

20 antibodies, host animals such as those described above, may be immunized by injection 

21 with gene product supplemented with adjuvants as also described above. 

22 Monoclonal antibodies that recognize perturbagens may be prepared using any technique 

23 that provides for the production of antibody molecules by continuous cell lines in culture. 

24 These include, but are not limited to, the hybridoma technique, the human B-cell 

25 hybridoma technique, and the EBV hybridoma technique, (see, for example, Kohler, G. et 

26 al. (1975) "Continuous cultures of fused cells secreting antibody of predefined 

27 specificity." Nature 256:495-497; Kozbor, D. et al (1985) "Specific immunoglobulin 

28 production and enhanced tumorigenicity following ascites growth of human 

29 hybridomas." J. Immunol. Methods 81:31-42; Cote, R J. et al. (1983) PNAS 80:2026- 

30 2030; and Cole, S.P. et al. (1984) "Generation of human monoclonal antibodies reactive 

3 1 with cellular antigens" Mol Cell Biol 62: 109-120). 



53 



1 In addition, one may use techniques developed for the production of chimeric 

2 antibodies, such as the splicing of mouse antibody genes to human antibody genes to 

3 obtain a molecule with appropriate antigen specificity and biological activity. See, e.g., 

4 Morrison, S.L. et al. (1984) "Chimeric human antibody molecules: mouse antigen- 

5 binding domains with human constant region domains," PNAS 81:6851-6855); 

6 Neuberger, M.S. et al. (1984) "Recombinant antibodies possessing novel effector 

7 functions" Nature 312:604-608; and Takeda, S. et al. (1985) "Construction of chimeric 

8 processed immunoglobulin genes containing mouse variable and human constant region 

9 sequences." Nature 314:452-454). Alternatively, techniques described for the production 

10 of single chain antibodies may be adapted, using methods known in the art, to produce 

1 1 perturbagen-specific antibodies (see, e.g. Burton, D.R. (1991) "A large array of human 

12 monoclonal antibodies to type 1 human immunodeficiency virus from combinatorial 

13 libraries of asymptomatic seropositive individuals." PNAS 88: 10134-10137). 

14 Antibodies may also be produced by inducing in vivo production in the lymphocyte 

15 population or by screening immunoglobulin libraries or panels of highly specific binding 

16 reagents as disclosed in the literature, (see, for example, Orlandi, R. et al. (1989) 

17 "Cloning immunoglobulin variable domains for expression by the polymerase chain 

18 reaction." PNAS 86:3833-3837; Winter, G. et al (1991) "Man-made antibodies." Nature 

19 349: 293-299). 

20 Antibody fragments that contain specific binding sites for perturbagens may also 

21 be generated. For example, such fragments include, but are not limited to F(ab r )2 

22 fragments produced by pepsin digesting of the antibody molecule and Fab fragments 

23 generated by reducing the disulfide bridges of the F(ab') 2 fragments. Alternatively, Fab 

24 expression libraries may be constructed to allow rapid and easy identification of 

25 monclonal Fab fragments with the desired specificity. (See, for example, Huse, W.D. et 

26 al. (1989) "Generation of a large combinatorial library of the immunoglobulin repertoire 

27 in phage lambda." Science 246: 1275-1281). 

28 M. Screening Assays 

29 The agents of the invention can be used to screen for drugs or compounds (small 

30 molecules) that mimic, or modulate the activity or expression of said phenotypic probes. 

3 1 The present invention may be employed in a process for screening for agents such as 
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1 agonists, ie. agents that bind to and activate a (3-catenin/Tcf pathway target, or 

2 antagonists, le, inhibit the activity or interaction of an p-catenin/Tcf pathway target with 

3 an endogenous or exogenous ligand. Thus, polypeptides of the invention may also be 

4 used to assess the binding of small molecule substrates and ligands in, for example, cells, 

5 cell-free preparations, chemical libraries, and natural product mixtures as known in the 

6 art. Any methods routinely used to identify and screen for agents that can modulate 

7 receptors may be used in accordance with the present invention. 

8 Like the perturbagen itself, such small molecule compounds may be used to treat 

9 disorders characterized by insufficient or excessive production of a target which has 

10 decreased or aberrant activity compared to the wild type entity. Thus, the invention 

1 1 provides a method for identifying modulators, i.e. candidate or test compounds or agents 

12 (e.g. peptidomimetics, small molecules or other drugs) that bind to the agent or its target, 

13 and have a stimulatory or inhibitory effect on the pathway(s) affected by said agent. 

14 In vitro systems may be designed to identify compounds capable of binding, e.g., a (3- 

15 catenin pathway target gene product. Such compounds may include, but are not limited 

16 to, peptides made of D-and/or L-configuration amino acids (in, for example, the form of 

17 random peptide libraries; (see e.g., Lam, et ah, Nature, 354:82-4 (1991)), 

18 phosphopeptides (in, for example, the form of random or partially degenerate, directed 

19 phosphopeptide libraries; see, e.g., Songyang, et ah, Cell, 72:767-78 (1993)), antibodies, 

20 and small organic or inorganic molecules. Compounds identified may be useful, for 

21 example, in modulating the activity of p-catenin pathway target gene proteins, preferably 

22 mutant proteins; elaborating the biological function of the p-catenin pathway target gene 

23 protein; or screening for compounds that disrupt normal p-catenin pathway target gene 

24 interactions or themselves disrupt such interactions. 

25 In one embodiment, the invention provides libraries of test compounds. The test 

26 compounds of the present invention can be obtained using any of the numerous 

27 approaches in combinatorial library methods known in the art, including: biological 

28 libraries, spatially addressable parallel solid phase or solution phase libraries; synthetic 

29 library methods requiring deconvolution; the one-bead one-compound library method; 

30 and synthetic library methods using affinity chromatography selection. The biological 

31 library approach is exemplified by peptide libraries, while the other four approaches are 
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1 applicable to peptide, non-peptide oligomer or small molecule libraries of compounds 

2 (Lam, K.S. (1997) "Application of combinatorial library methods in cancer research and 

3 drug discovery." Anticancer Drug Des. 12: 145). 

4 Methods for the synthesis of molecular libraries can be found in the art, for example, in 

5 (i) De Witt, S.H. et al. (1993) "Diversomers: an approach to nonpeptide, nonoligomeric 

6 chemical diversity." PNAS 90:6909, (ii) Erb, R et al. (1994) "Recursive deconvolution 

7 of combinatorial chemical libraries ."PNAS 91:11422, (iii) Zuckermann, R.N. et al. 

8 (1994) "Discovery of nanomolar ligands for 7-transmembrane G-protein-coupled 

9 receptors from a diverse N-(substituted)glycine peptoid library." Med Chent. 37: 2678 

10 and (iv) Cho, C.Y. et al. (1993) "An unnatural biopolymer." Science 261: 1303. Libraries 

11 of compounds may be presented in i) solution (e.g. Houghten, R.A. (1992) "The use of 

12 synthetic peptide combinatorial libraries for the identification of bioactive peptides " 

13 BioTechniques 13:412) ii) on beads (Lam, K.S. (1991) "A new type of synthetic peptide 

14 library for identifying ligand-binding activity." Nature 354:82), iii) chips (Fodor, S.P. 

15 (1993) "Multiplexed biochemical assays with biological chips." Nature 364:555), iv) 

16 bacteria (U.S. Patent # 5,223,409), v) spores (Patent Nos 5,571,698, 5,403,484, and 

17 5,223,409), vi) plasmids (Cull, M.G. et al. (1992) "Screening for receptor ligands using 

18 large libraries of peptides linked to the C terminus of the lac repressor." PNAS 89: 1865) 

19 or vii) phage (Scott, J.K. and Smith, G.P. (1990) "Searching for peptide ligands with an 

20 epitope library." Science 249: 386) 

21 There are several methods for identifying small molecule compounds that mimic 

22 the action of the phenotypic probes. In one approach, an assay may be devised to directly 

23 identify agents that bind to, e.g., a Pcatenin/TCF pathway target protein. Such direct 

24 binding assays generally involve preparing a reaction mixture of the Pcatenin/TCF 

25 pathway target protein and the test compound under conditions and for a time sufficient 

26 to allow the two components to interact and bind, thus forming a complex that can be 

27 removed and/or detected in the reaction mixture. These assays can be conducted in a 

28 variety of ways. For example, one method to conduct such an assay would involve 

29 anchoring the Pcatenin/TCF pathway target protein or the test substance onto a solid 

30 phase and detecting target protein/test substance complexes anchored on the solid phase 

31 at the end of the reaction. In one embodiment of such a method, the Pcatenin/TCF 
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1 pathway target protein may be anchored onto a solid surface, and the test compound, 

2 which is not anchored, may be labeled, either directly or indirectly. 

3 In practice, microtitre plates are conveniently utilized. The anchored component 

4 may be immobilized by non-covalent or covalent attachments. Non-covalent attachment 

5 may be accomplished simply by coating the solid surface with a solution of the protein 

6 and drying. Alternatively, an immobilized antibody, preferably a monoclonal antibody, 

7 specific for the protein may be used to anchor the protein to the solid surface. The 

8 surfaces may be prepared in advance and stored. 

9 In order to conduct the assay, the nonimmobilized component is added to the 

10 coated surface containing the anchored component. After the reaction is complete, 

1 1 unreacted components are removed (e.g. , by washing) under conditions such that any 

12 complexes formed will remain immobilized on the solid surface. The detection of 

13 complexes anchored on the solid surface can be accomplished in a number of ways. 

14 Where the previously nonimmobilized component is pre-labeled, the detection of label 

15 immobilized on the surface indicates that complexes were formed. Where the previously 

16 nonimmobilized component is not pre-labeled, an indirect label can be used to detect 

17 complexes anchored on the surface; e.g., using a labeled antibody specific for the 

18 previously nonimmobilized component (the antibody, in turn, may be directly labeled or 

19 indirectly labeled with a labeled anti-Ig antibody). 

20 Alternatively, a reaction can be conducted in a liquid phase, the reaction products 

21 separated from unreacted components, and complexes detected; e.g., using an 

22 immobilized antibody specific for a Pcatenin/TCF pathway gene product or the test 

23 compound to anchor any complexes formed in solution, and a labeled antibody specific 

24 for the other component of the possible complex to detect anchored complexes. 

25 Compounds that are shown to bind to a particular Pcatenin/TCF pathway gene product 

26 through one of the methods described above can be further tested for their ability to elicit 

27 a biochemical response from the a Pcatenin/TCF pathway gene protein. Agonists, 

28 antagonists and/or inhibitors of the expression product can be identified utilizing assays 

29 well known in the art. 

30 In another approach, perturbagen/target pairs are used to identify small molecule 

31 mimetics in a displacement assay format. Such assays can be based upon a variety of 
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1 technologies including, but not limited to i) ELISAs (see, for example, Rice, J.W. et al- 

2 (1996) "Development of a high volume screen to identify inhibitors of endothelial cell 

3 activation." Anal Biochem 241(2):254~9), ii) scintillation proximity assays (see, for 

4 example, Lerner, C.G. and Saiki, A.Y.C. (1996) "Scintillation proximity assay for human 

5 DNA topoisomerase I using recombinant biotinyl-fusion protein produced in baculovirus- 

6 infected insect cells." Anal Biochem 240(2): 185-96), or iii) time-resolved fluorescence 

7 resonance energy transfer-based technology (see, for example, Fernandes, P.B. (1998) 

8 "Technological advances in high-throughput screening." Curr Opin Chem Biol 2(5): 597- 

9 603; Hemmila, "Time-resolved fluorometry - advantages and potentials in high 

10 throughput screening assays." "High Throughput Screening'", J. Devlin (ed.). Marcel 

1 1 Dekker Inc, New York, pp. 361-76 (1997)). Two non-limiting examples of such assays, 

12 one homogeneous, LANCE™ (Stenroos, K. et al. (1997) "Homogeneous time resolved 

13 fluorescence energy transfer assay (LANCE) for the determination of IL-2-IL-2 receptor 

14 interaction." Abstract of Papers Presented at the 3rd Annual Conference of the Society 

15 for Biomolecular Screening, Sep., California), and one heterogeneous, DELFIA™ 

16 (MacGregor, L et al. (1999) "Application of a time-resolved fluoroimmunoassay for the 

17 analysis of normal prion protein in human blood and its components." Vox Sang 

18 77(2):88-96; Jensen, P.E. et al. (1998) "A europium fluoroimmunoassay for measuring 

19 peptide binding to MHC class I molecules." J. Immunol Methods 215: 71-80; Takeuchi, 

20 T. et al. (1995) "Nonisotopic receptor assay for benzodiazepine drugs using time- 

21 resolved fluorometry." Anal Chem. 67: 2655-8) are described as follows. 

22 1. Lance™: Homogeneous Assay 

23 To identify small molecules capable of disrupting the interaction between the 

24 perturbagen and its target, assays are designed to utilize the LANCE™ technology 

25 (commercially available from E. G. & G. Wallac). LANCE™ is a homogeneous assay 

26 that is performed in solution and requires no wash steps to separate bound and unbound 

27 label. Briefly, the target is produced in large quantities and labeled with a lanthanide 

28 chelate (i.e. a fluorescent donor such as a Europium, (Eu) or Terbium (Tb) chelate). 

29 Concomitantly, the perturbagen is labeled with one of several fluorescent "acceptor" 

30 moieties that can be excited by the emissions of the donor molecule (e.g. allophycocyanin 

31 (APC) or rhodamine Rh, respectively). Most preferably, 1) the modification of either the 
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1 perturbagen or the target is not detrimental to the interaction between the two interacting 

2 molecules being studied and 2) the distance separating the donor and acceptor moieties 

3 when the perturbagen and the target are associated, is sufficiently close to permit FRET 

4 (typically 30-100 Angstroms). As an alternative to direct labeling of the perturbagen, 

5 monoclonal antibodies directed against the perturbagen can be labeled with Eu, thus 

6 allowing small molecule displacement assays to take place via indirect labeling 

7 procedures. 

8 To identify small molecules capable of disrupting the interaction between the 

9 perturbagen and its target, the two labeled components are alliquoted into wells (1536 

10 well format) at previously set, optimized conditions that will ensure 50% binding (Figure 

11 7). Subsequently, each well is then exposed to one or more members of a large chemical 

12 combinatorial library and time-resolved measurements are taken using a Wallac 1420 

13 Victor multilabel counter or equivalent fluoremeter. In wells that contain a small 

14 molecule that interferes with the interaction between the perturbagen and its target, the 

15 distance separating the donor and acceptor molecules is increased. As a result of this 

16 dissociation or displacement, the ability of the Eu emissions to excite the acceptor is 

17 compromised and the total fluorescence emitted by the acceptor is decreased. 

18 2. DELFIA™: Heterogeneous Assay 

19 Several variations of a heterogeneous assay (DELFIA™) using an immobilized 

20 substrate can be used as an alternative to LANCE™. In one non-limiting example, the 

21 target is immobilized to a solid support using a monoclonal antibody that has been 

22 labeled with Eu (Figure 8). Subsequent addition and binding of a rhodamine labeled 

23 perturbagen in the presence or absence of a candidate small organic displacement 

24 molecule is followed by several wash steps to remove unbound material. TR-FRET is 

25 then performed by exciting Eu and measuring the levels of Rh emissions. As an 

26 alternative to this procedure, the target is immobilized to the solid support using an 

27 unlabeled monoclonal antibody. Subsequently, an Eu-labeled perturbagen (+/- a 

28 candidate small organic displacement molecule) is added to each well and allowed to 

29 equilibrate, followed by a washing procedure to eliminate unbound Eu-labeled material. 

30 Once the well has been cleared of all unbound material, the bound Eu-perturbagen 

3 1 molecules are released and excited in the presence of commercially available 
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1 enhancement solutions (DELHA Enhancement Solutions, Wallac). By comparing the 

2 levels of emissions in wells that contain members of the molecule library with 

3 standardized controls, small molecules that disrupt the interaction between the 

4 perturbagen and its target are identified. 

5 Another preferred method for identifying small molecule mimetics makes use of a 

6 variation of the two-hybrid technology. As one non-limiting example of how a two- 

7 hybrid chemical screen is performed, the yeast host cells containing i) AD-perturbagen, 

8 ii) the BD-target, and iii) a reporter construct made up of a promoter recognized by the 

9 BD, functionally linked to, for instance, the gene encoding lacZ or ZsGreen (Clontech), 

10 are grown in liquid culture media and subjected to the test chemical. Assay plates are 

1 1 then incubated at 30°C for 48 hours and samples are scored by looking the expression of 

12 the marker by FACS or other conventional techniques. As an alternative, compounds 

13 that are attached to a solid support (e.g. beads) can be tested for their ability to rescue the 

14 growth phenotype in solution -based assays. Specifically, yeast cells modified for reverse 

15 genetic studies can be arrayed in nanodroplets (100-200 nanoliter volumes) that contain i) 

16 the selective elements of the medium (e.g. 5-FOA, cyclohexamide) and ii) one or more 

17 beads linked to a chemical library member. Subsequent photolysis of the chemical agent 

18 from the bead allows diffusion of the test molecules into the yeast cell and disruption of 

19 the two-hybrid interaction (see, Borchardt A et al. (1997) "Small molecule-dependent 

20 genetic selection in stochastic nanodroplets as a means of detecting protein-ligand 

21 interactions on a large scale." Chemical Biology 4(12):961-8; You, A J. et al. (1997) "A 

22 miniaturized arrayed assay format for detecting small molecule-protein interactions in 

23 cells." Chem Biol 4(12):969-75; Huang, J. Schreiber, S.L. (1997) "A yeast genetic 

24 system for selecting small molecule inhibitors of protein-protein interactions in 

25 nanodroplets." PNAS 94:13396-13401; Young, K. et al. (1998) : Identification of a 

26 calcium channel modulator using a high throughgput yeast two-hybrid screen." Nature 

27 Biotechnology 16:946-950). 

28 L. Therapeutic Uses 

29 Natural and synthetic chemotherapeutic derivatives have proven valuable in the 

30 treatment of a variety of forms of disease. For that reason, in one embodiment, 

31 perturbagens, fragments or derivatives of a perturbagen, small molecule mimetics of a 
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1 perturbagen, sequences encoding perturbagens, sequences that can hybridize to 

2 perturbagen encoding sequences, targets of the perturbagen, or agents that bind said 

3 target (e.g. antibodies) or portions thereof, may be utilized to treat or prevent a disorder 

4 that has previously shown sensitivity to treatment with chemotherapeutics and/or 

5 radiation therapy. Thus, for example, polypeptides or RNA molecules described herein 

6 can be used i) modulate cellular proliferation, ii) modulate cellular differentiation, iii) 

7 induce or modulate necrotic or apototic processes, or iv) sensitize cells to secondary 

8 compounds that induce either i), ii), or iii) by direct application of said agent. Examples 

9 of such disorders that may be aided by such agents include, but are not limited to cancers 

10 of the I) ovary, ii) liver, iii) endometrium, iv) colon and/or rectum, v) prostrate, vi) 

1 1 uterus, vii) esophagus, viii) kidney, ix) thyroid, x) stomach, xi) brain, and xii) skin (e.g. 

12 melanoma). In addition, agents identified in previously described screens may be 

13 applicable in treating a variety of other diseases that directly or indirectly involve 

14 components of the P-catenin /TCF/APC pathway. Thus, any of the agents of the 

15 invention may be administered to a subject to treat or prevent, for instance, Alzheimer's 

16 disease, CHRPE and related afflictions. 

17 Ailments such as those described previously can be treated with the perturbagen 

18 or target directly, for example by administering a therapeutically effective dose of a 

19 proteinaceous agent intravenously or by other peptide delivery techniques known to the 

20 art. A therapeutically effective dose of a pharmaceutical composition comprising a 

21 substantially purified perturbagen, or a fragment thereof, or a small molecule mimetic, 

22 optionally in conjunction with a suitable pharmaceutical carrier, may be administered to a 

23 subject to treat or prevent a disorder previously shown to be related to the Pcatenin/Tcf 

24 pathway. A "therapeutically effective" dose refers to that amount of the compound 

25 sufficient to result in amelioration of symptoms of the disease. A "pharmaceutical 

26 carrier" includes any and all solvents, dispersion media, coatings, antibacterial and 

27 antifungal agents, and the like, compatible with pharmaceutical administration. The use 

28 of such media and agents for pharmaceutically active substances is well known in the art. 

29 Except insofar as any conventional media or agent is incompatible with the active 

30 compound, use thereof in the compositions is contemplated. 
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1 Toxicity and therapeutic efficacy of such compounds can be determined by 

2 standard pharmaceutical procedures in cell cultures or experimental animals, e.g., for 

3 determining the LD50 (the dose lethal to 50% of the population) and the ED 50 (the dose 

4 therapeutically effective in 50% of the population). The dose ratio between toxic and 

5 therapeutic effects is the therapeutic index and it can be expressed as the ratio LD50/ED50. 

6 Compounds that exhibit large therapeutic indices are preferred. While compounds that 

7 exhibit toxic side effects may be used, care should be taken to design a delivery system 

8 that targets such compounds to the site of affected tissue in order to minimize potential 

9 damage to uninfected cells and, thereby, reduce side effects. 

10 The data obtained from the cell culture assays and animal studies can be used in 

1 1 formulating a range of dosage for use in humans. The dosage of such compounds lies 

12 preferably within a range of circulating concentrations that include the ED 50 with little or 

13 no toxicity. The dosage may vary within this range depending upon the dosage form 

14 employed and the route of administration utilized. For any compound used in the method 

15 of the invention, the therapeutically effective dose can be estimated initially from cell 

16 culture assays, A dose may be formulated in animal models to achieve a circulating 

17 plasma concentration range that includes the IC50 the concentration of the test 

18 compound that achieves a half-maximal inhibition of symptoms) as determined in cell 

19 culture. Such information can be used to more accurately determine useful doses in 

20 humans. Levels in plasma may be measured, for example, by high performance liquid 

2 1 chromatography. 

22 Pharmaceutical compositions of the invention are formulated to be compatible 

23 with intended routes of delivery. Examples of routes of administration include parenteral 

24 e.g. intravenous, intradermal, subcutaneous, oral, inhalation, transdermal, topical, 

25 transmucosal, and rectal administration. Solutions or suspensions used for parenteral, 

26 intradermal, or subcutaneous application can include the following components: a sterile 

27 diluent, such as water for injection, saline solution, fixed oils, polyethylene, glycols, 

28 glycerine, propylene glycol, or other synthetic solvents, antibacterial agents such as 

29 benzyl alcohol or methyl parabens, antioxidants such as ascorbic acid or sodium bisulfite, 

30 chelating agents such as ethylenediaminetetraacetic acid, buffers such as acetates, 
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1 citrates, or phosphates and agents for the adjustment of tonicity such as sodium chloride 

2 or dextrose. 

3 Pharmaceutical compositions suitable for injectable use include aqueous solutions 

4 (where water-soluble) or dispersions and sterile powders for the extemporaneous 

5 preparation of sterile injectable solutions or dispersions. For intravenous administration, 

6 suitable carriers include physiological saline, bacteriostatic water Cremophor EL™ 

7 (BASF; Parsippany, NJ) or phosphate buffered saline (PBS). In all cases the composition 

8 must be sterile and should be fluid to the extent that easy syringability exists. Oral 

9 compositions can also be prepared using any of the following ingredients, or compounds 

10 of a similar nature: a binder such as microcrystalline cellulose, gum tragacanth, or 

11 gelatin; an excipient such as starch or lactose, disintegrating agent such as alginic acid, 

12 Primogel, or corn starch; a lubricant such as magnesium stearate, a glidant such as 

13 colloidal silicon dioxide, a sweetening agent such as sucrose or saccharin, or a flavoring 

14 agent such as peppermint or orange flavoring. For administration by inhalation, the 

15 compounds are delivered in the form of an aerosol spray from a pressurized container or 

16 dispenser that contains a suitable propellant Systemic administration can also be by 

17 transmucosal or transdermal means. For these methods of administration, penetrants 

18 appropriate to the barrier to be permeated are used in the formulation. Such penetrants 

19 are generally known in the art and include, for example, bile salts and fusidic acid 

20 derivatives. Transmucosal administration can also be accomplished through the use of 

21 nasal sprays and suppositories. For transdermal administration, the active compounds are 

22 formulated into ointments, salves, gels, or creams as generally known in the art. 

23 In one embodiment, the active compounds are prepared with carriers that will 

24 protect the compound against rapid elimination from the body, such as a controlled 

25 microencapsulated delivery system. Biodegradable, biocompatible polymers can be used, 

26 such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, collagen, 

27 polyorthoesters, and polylactic acid. Methods for preparation of such formulations will be 

28 apparent to those skilled in the art. The materials can also be obtained commercially 

29 from Alza Corporation and Nova Pharmaceuticals, Inc. Liposomal suspensions 

30 (including liposomes targeted to infected cells with monoclonal antibodies to specific cell 

31 surface epitopes) can also be used as pharmaceutically acceptable carriers. These can be 
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1 prepared according to methods known to those skilled in the art, for example, as 

2 described in U.S. Patent No. 4,522,811. 

3 Alternatively, such therapeutics can be administered indirectly, for example by gene 

4 therapy utilizing a gene or RNA sequence encoding a perturbagen, pathway target, or 

5 variant or fragment of the foregoing. For example, a vector capable of expressing a 

6 perturbagen or target, or a fragment or derivative thereof, may be administered to a 

7 subject to treat or prevent a disease. Expression vectors including, but not limited to, 

8 those derived from retroviruses, adenoviruses, adeno-associated viruses, or herpes or 

9 vaccinia viruses or from various bacterial plasmids, may be used for delivery of 

10 nucleotide sequences to the targeted organ, tissue, or cell population (see, for example, 

11 Carter, P.J. and Samulski, RJ. (2000) "Adeno-associated viral vectors as gene delivery 

12 vehicles " Int J Mol Med. 6(1): 17-27; Palu, G. et al. (2000) "Progress with retroviral gene 

13 vectors." Rev Med Virol 10(3): 185-202; Wu, N. and Ataai, M.M. (2000) "Production of 

14 viral vectors for gene therapy applications." Curr Opin Biotechnol. 1 1(2):205-8). Gene 

15 therapy vectors can be delivered to a subject by, for example, intravenous injection, local 

16 administration (U.S. Patent 5,328,470) or by stereotactic injection (see, for example, 

17 Chen, S.H. et al. (1994) "Gene therapy for brain tumors: regression of experimental 

18 gliomas by adenovirus-mediated gene transfer in vivo " PNAS 91:3054-3057). The 

19 pharmaceutical preparation of the gene therapy vector can include the gene therapy 

20 vector in an acceptable diluent, or can comprise a slow release matrix in which the gene 

21 delivery vehicle is imbedded. Alternatively, where the complete gene delivery vector can 

22 be produced intact from recombinant cells, e.g. retroviral vectors, the pharmaceutical 

23 preparation can include one or more cells which produce the gene delivery system. . 

24 M. Antisense, Ribozyme and Antibody Therapeutics 

25 Other agents that may be used as therapeutics include any pathway target genes, 

26 associated expression product and functional fragments thereof. Additionally, agents that 

27 reduce or inhibit mutant pathway target gene activity may be used to ameliorate disease 

28 symptoms. Such agents include antisense, ribozyme, and triple helix molecules. 

29 Techniques for the production and use of such molecules are well known to those of skill 

30 in the art. 
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1 Anti-sense RNA and DNA molecules act to directly block the translation of 

2 mRNA by hybridizing to targeted mRNA and preventing protein translation. With 

3 respect to antisense DNA, oligodeoxyribonucleotides derived from the translation 

4 initiation site, e.g., between the -10 and +10 regions of the P-catenin pathway target gene 

5 nucleotide sequence of interest, are preferred. 

6 Ribozymes are enzymatic RNA molecules capable of catalyzing cleavage of 

7 specific RNAs. The mechanism of ribozyme action involves sequence-specific 

8 hybridization of the ribozyme molecule to complementary target RNA, followed by an 

9 endonucleolytic cleavage. The composition of ribozyme molecules must include one or 

10 more sequences complementary to a P-catenin pathway target gene mRNA, and must 

1 1 include the well known catalytic sequence responsible for mRNA cleavage. For this 

12 sequence, see U.S. Patent No. 5,093,246, which is incorporated by reference herein in its 

13 entirety. As such within the scope of the invention are engineered hammerhead motif 

14 ribozyme molecules that specifically and efficiently catalyze endonucleolytic cleavage of 

15 RNA sequences encoding anaphylatoxin C3a receptor gene proteins. 

16 Specific ribozyme cleavage sites within any potential RNA target are initially 

17 identified by scanning the molecule of interest for ribozyme cleavage sites that include 

18 the following sequences, GUA, GUU and GUC. Once identified, short RNA sequences of 

19 between 15 and 20 ribonucleotides corresponding to the region of the p-catenin pathway 

20 target gene containing the cleavage site may be evaluated for predicted structural 

21 features, such as secondary structure, that may render the oligonucleotide sequence 

22 unsuitable. The suitability of candidate sequences may also be evaluated by testing their 

23 accessibility to hybridization with complementary oligonucleotides, using ribonuclease 

24 protection assays. 

25 Nucleic acid molecules to be used in triple helix formation for the inhibition of 

26 transcription should be single stranded and composed of deoxyribonucleotides. The base 

27 composition of these oligonucleotides must be designed to promote triple helix formation 

28 via Hoogsteen base pairing rules, which generally require sizeable stretches of either 

29 purines or pyrimidines to be present on one strand of a duplex. Nucleotide sequences may 

30 be pyrimidine-based, which will result in TAT and CGC triplets across the three 

31 associated strands of the resulting triple helix. The pyrimidine-rich molecules provide 
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1 base complementarity to a purine-rich region of a single strand of the duplex in a parallel 

2 orientation to that strand. In addition, nucleic acid molecules may be chosen that are 

3 purine-rich, for example, containing a stretch of G residues. These molecules will form a 

4 triple helix with a DNA duplex that is rich in GC pairs, in which the majority of the 

5 purine residues are located on a single strand of the targeted duplex, resulting in GGC 

6 triplets across the three strands in the triplex. 

7 Alternatively, the potential sequences that can be targeted for triple helix 

8 formation may be increased by creating a so called "switchback" nucleic acid molecule. 

9 Switchback molecules are synthesized in an alternating 5-3', 3'-5' manner, such that they 

10 base pair with first one strand of a duplex and then the other, eliminating the necessity for 

11 a sizeable stretch of either purines or pyrimidines to be present on one strand of a duplex. 

12 It is possible that the antisense, ribozyme, and/or triple helix molecules described herein 

13 may reduce or inhibit the transcription (triple helix) and/or translation (antisense, 

14 ribozyme) of mRNA produced by both normal and mutant p-catenin pathway target gene 

15 alleles. In order to ensure that substantially normal levels of p-catenin pathway target 

16 gene activity are maintained, nucleic acid molecules that encode and express P-catenin 

17 pathway target gene polypeptides exhibiting normal activity may be introduced into cells 

18 that do not contain sequences susceptible to whatever antisense, ribozyme, or triple helix 

19 treatments are being utilized. Alternatively, it may be preferable to co-administer normal 

20 P-catenin pathway target gene protein into the cell or tissue in order to maintain the 

21 requisite level of cellular or tissue P-catenin pathway target gene activity. 

22 Anti-sense RNA and DNA, ribozyme, and triple helix molecules of the invention 

23 may be prepared by any method known in the art for the synthesis of DNA and RNA 

24 molecules. These include techniques for chemically synthesizing 

25 oligodeoxyribonucleotides and oligoribonucleotides well known in the art such as for 

26 example solid phase phosphoramidite chemical synthesis. Alternatively, RNA molecules 

27 may be generated by in vitro and in vivo transcription of DNA sequences encoding the 

28 antisense RNA molecule. Such DNA sequences may be incorporated into a wide variety 

29 of vectors that incorporate suitable RNA polymerase promoters such as the T7 or SP6 

30 polymerase promoters. Alternatively, antisense cDNA constructs that synthesize 
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1 antisense RNA constitutively or inducibly, depending on the promoter used, can be 

2 introduced stably into cell lines. 

3 Various well-known modifications to the DNA molecules may be introduced as a 

4 means of increasing intracellular stability and half-life. Possible modifications include 

5 but are not limited to the addition of flanking sequences of ribonucleotides or 

6 deoxyribonucleo tides to the 5 f and/or 3' ends of the molecule or the use of 

7 phosphorothioate or 2' O-methyl rather than phosphodiesterase linkages within the 

8 oligodeoxyribonucleotide backbone. 

9 Antibodies that are both specific for P-catenin pathway target gene protein, and in 

10 particular, mutant gene protein, and interfere with its activity may be used to inhibit 

1 1 mutant P-catenin pathway target gene function. Such antibodies may be generated 

12 against the proteins themselves or against peptides corresponding to portions of the 

13 proteins using standard techniques known in the art and as also described herein. Such 

14 antibodies include but are not limited to polyclonal, monoclonal, Fab fragments, single 

15 chain antibodies, chimeric antibodies, etc. 

16 In instances where a p-catenin pathway target gene protein is intracellular and 

17 whole antibodies are used, internalizing antibodies may be preferred. However, 

18 lipofectin liposomes may be used to deliver the antibody or a fragment of the Fab region 

19 that binds to the p-catenin pathway target gene epitope into cells. Where fragments of the 

20 antibody are used, the smallest inhibitory fragment that binds to the target or expanded 

21 target protein's binding domain is preferred. For example, peptides having an amino acid 

22 sequence corresponding to the domain of the variable region of the antibody that binds to 

23 the p-catenin pathway target gene protein may be used. Such peptides may be 

24 synthesized chemically or produced via recombinant DNA technology using methods 

25 well known in the art (see, e.g., Creighton, Proteins: Structures and Molecular Principles 

26 (1984) W.H. Freeman, New York 1983, supra; and Sambrook, et al, 1989, supra). 

27 Alternatively, single chain neutralizing antibodies that bind to intracellular P-catenin 

28 pathway target gene epitopes may also be administered. Such single chain antibodies 

29 may be administered, for example, by expressing nucleotide sequences encoding single- 

30 chain antibodies within the target cell population by utilizing, for example, techniques 
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1 such as those described in Marasco, et al, Proc. Natl Acad, ScL USA, 90:7889-93 

2 (1993). 

3 N. Diagnostic Uses 

4 The polynucleotides, polypeptides, variants, targets and antibodies to any one of 

5 these molecules can, in addition to previously mentioned therapeutic applications, be 

6 used in one or more of the following methods: 1) detection assays (e.g. chromosomal 

7 mapping, tissue typing, forensic biology), and 2) predictive medicine (e.g, diagnostic or 

8 prognostic assays, pharmacogenomics and monitoring clinical trials). Thus, for example, 

9 agents may be used to detect a specific mRNA or gene (e.g. in a biological sample) for a 

10 genetic lesion. Similarly, agents described herein may be applied to the field of 

1 1 predictive medicine in which diagnostic assays or prognostic assays, pharmacogenomics, 

12 and monitoring clinical trials are used for predictive purposes to thereby treat an 

13 individual prophylactically. 

14 Accordingly, one aspect of the present invention relates to diagnostic assays for 

15 determining expression of a polypeptide or nucleic acid of the invention and/or activity of 

16 said agent of the invention, in the context of a biological sample to thereby determine 

17 whether an individual is afflicted with a disease or disorder, or is at risk of developing a 

18 disorder, associated with aberrant expression or activity of a polypeptide or 

19 polynucleotide of the invention. 

20 Alternatively, the invention provides methods for detecting expression of a 

21 nucleic acid or polypeptide of the invention or activity of a polypeptide or polynucleotide 

22 of the invention in an individual to thereby select appropriate therapeutic or prophylactic 

23 agents for that individual (referred to herein as "pharmacogenomics"). 

24 Pharmoacogenomics allows for the selection of agents (e.g. drugs) for therapeutic or 

25 prophylactic treatment of an individual based on the genotype of the individual (e.g, the 

26 genotype of the individual examined to determine the ability of the individual to respond 

27 to a particular agent). Still another aspect of the invention pertains to monitoring the 

28 influence of agents (e.g. drugs or other compounds) on the expression or activity of a 

29 polypeptide or polynucleotide of the invention in clinical trials. 
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1 1. Detection Assays 

2 Portions or fragments of the polynucleotide sequences of the invention can be 

3 used in numerous ways as polynucleotide reagents. For example, these sequences can be 

4 used to i) map their respective genes on a chromosome and, thus, locate gene regions 

5 associated with genetic diseases; ii) identify an individual from a minute biological 

6 sample (tissue typing); and iii) aid in forensic identification of biological samples. 

7 a. Gene and Chromosome Mapping. 

8 Once the sequence (or portion of a sequence) of a gene has been isolated, this 

9 sequence can be used to identify the entire gene, analyze the gene for homology to other 

10 sequences (i.e., identify it as a member of a gene family such as EGF receptor family) 

1 1 and then map the location of the gene on a chromosome. Accordingly, nucleic acid 

12 molecules described herein or fragments thereof, can be used to map the location of the 

13 gene on a chromosome. The mapping of the sequences to chromosomes is an important 

14 first step in correlating these sequences with genes associated with disease. 

15 Briefly, genes can be mapped to chromosomes by preparing PCR primers from the 

16 sequence of a gene of the invention. These primers can then be used for PCR screening 

17 of somatic cell hybrids containing individual chromosomes. Only those hybrids 

18 containing the human gene corresponding to the gene sequences will yield an amplified 

19 fragment (For review of this technique se D 1 Eustachio, P. and Ruddle, F.H. (1983) 

20 "Somatic cell genetics and gene families." Science 220:919-924). Alternative methods of 

21 mapping a gene to its chromosome include in situ hybridization (see, for example, Fan, 

22 Y.S. et al. (1990) "Mapping small DNA sequences by fluorescence in situ hybridization 

23 directly on banded metaphase chromosomes." PNAS 87:6223-27), pre-screening with 

24 labeled flow sorted chromosomes (CITE), and pre-selection by hybridization to 

25 chromosome specific cDNA libraries. Furthermore, fluorescence in situ hybridization 

26 (FISH) of a DNA sequence to a metaphase chromosome spread can further be used to 

27 provide a precise chromosomal location in one step (see "Human Chromosomes: A 

28 Manual of Basic Techniques", Pergamon Press, New York, 1988). Lastly, with the 

29 completion (in the not-to-distant future) of the sequencing of the human genome, 

30 chromosome mapping will very quickly switch from elaborate, hands-on methods of 

31 mapping genes, to simple database searches 
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1 Once the sequence (or portion of a sequence) of a gene has been isolated, these 

2 agents can be used to assess the intactness or functionality of a particular gene. 

3 Comparison of affected and unaffected individuals can begin with looking for structural 

4 alterations in the chromosomes such as deletions, inversions, or translocations that are 

5 based on that DNA sequence. Once this is accomplished, the physical position of the 

6 sequence on the chromosome can be correlated with genetic data map. (such data are 

7 found, for example in McKusick, V. "Mendialian Inheritance in Man" available on-line 

8 through John Hopkins University Welch Medical Library). The relationship between 

9 genes and disease, mapped to the same chromosomal region can then be identified 

10 through linkage analysis (co-inheritance of physically adjacent genes), described in e.g. 

1 1 Egeland, J. A. et al (1987) "Bipolar affective disorders linked to DNA markers on 

12 chromosome 11." Nature, 325:783-787). Alternatively, polynucleotide sequences can be 

13 used as probes in Southern Blot analysis to identify alterations in the organization of the 

14 gene of interest and surrounding regions. Ultimately, complete sequencing of genes from 

15 several individuals can be performed to confirm the presence of a mutation and to 

16 distinguish mutations from polymorphisms. If a specific mutation is observed in some or 

17 all individuals affected by a particular disease, but not in any unaffected individuals, then 

18 the mutation is likely to be the causative agent of the particular disease. 

19 b. Tissue Typing 

20 The nucleic acid sequences of the present invention can also be used to identify 

21 individuals from minute biological samples. The United States military, for example, is 

22 considering the use of restriction fragment length polymorphism (RFLP) for 

23 identification of its personnel. In this technique, an individual's genomic DNA is 

24 digested with one or more restriction enzymes, and probed on a Southern blot to yield 

25 unique bands for identification. The sequences of the present invention are useful as 

26 additional DNA markers for RFLP mapping (described in US Patent 5,272,057). 

27 Furthermore the sequences of the present invention can be used to determine the actual 

28 base-by-base DNA sequence of selected portions of an individual's genome. Thus, the 

29 nucleic acid sequences described herein can be used to prepare two PCR primers from the 

30 5' and 3' ends of the individual's DNA and subsequently sequence it. Panels of 

31 corresponding DNA sequences from individuals, prepared in this manner, can provide 
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1 unique individual identifications, as each individual will have a unique set of such DNA 

2 sequences due to allelic variation. The sequences of the present invention can be used to 

3 obtain such identification sequences from individuals and from tissue. The nucleic acid 

4 sequences of the invention uniquely represent portions of the human genome. Allelic 

5 variation occurs to some degree in the coding regions of these sequences, and to a greater 

6 degree in the non-coding regions. It is estimated that allelic variation between individual 

7 humans occurs with a frequency of about once per 500 bases. Thus, each of the 

8 sequences described herein may be, to some degree, used as a standard against which 

9 DNA from an individual can be compared for identification purposes. 

10 c. Forensic Biology 

1 1 In addition the sequences described herein can be used in forensic biology. 

12 Forensic biology is a scientific field employing genetic typing of biological evidence 

13 found at a crime scene as a means for positively identifying, for example a perpetrator of 

14 a crime. To make such an identification, PCR-based technology can be used to amplify 

15 DNA sequences taken from very small biological samples such as tissues, (e.g. hair, skin, 

16 or body fluids). The amplified sequence can then be compared to a standard thereby 

17 allowing identification of the origin of the biological sample. 

1 8 The sequences of the present invention can be used to provide polynucleotide 

19 reagents (e.g. PCR primers) targeted to specific loci in the human genome, which can 

20 enhance the reliability of DNA-based forensic identifications by, for example, providing 

21 another "identification marker" (i.e. another DNA sequence that is unique to a particular 

22 individual. The nucleic acid sequences described herein can further be used to provide 

23 polynucleotide reagents e.g. labeled or labelable probes, which can be used in, for 

24 example, an in situ hybridization technique, to identify a specific tissue. This technique 

25 can be exceedingly useful in cases where a forensic pathologist is presented with a tissue 

26 of unknown origin. Panels of such probes can be used to identify tissue by species and/or 

27 organ type. 

28 O. Predictive Medicine 

29 Portions or fragments of the polynucleotide sequences of the invention can be 

30 used for predictive purposes to thereby treat an individual prophylactically. 
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1 unique individual identifications, as each individual will have a unique set of such DNA 

2 sequences due to allelic variation. The sequences of the present invention can be used to 

3 obtain such identification sequences from individuals and from tissue. The nucleic acid 

4 sequences of the invention uniquely represent portions of the human genome. Allelic 

5 variation occurs to some degree in the coding regions of these sequences, and to a greater 

6 degree in the non-coding regions. It is estimated that allelic variation between individual 

7 humans occurs with a frequency of about once per 500 bases. Thus, each of the 

8 sequences described herein may be, to some degree, used as a standard against which 

9 DNA from an individual can be compared for identification purposes. 

10 c. Forensic Biology 

11 In addition the sequences described herein can be used in forensic biology. 

12 Forensic biology is a scientific field employing genetic typing of biological evidence 

13 found at a crime scene as a means for positively identifying, for example a perpetrator of 

14 a crime. To make such an identification, PCR -based technology can be used to amplify 

15 DNA sequences taken from very small biological samples such as tissues, (e.g. hair, skin, 

16 or body fluids). The amplified sequence can then be compared to a standard thereby 

17 allowing identification of the origin of the biological sample. 

18 The sequences of the present invention can be used to provide polynucleotide 

19 reagents (e.g. PCR primers) targeted to specific loci in the human genome, which can 

20 enhance the reliability of DNA-based forensic identifications by, for example, providing 

21 another "identification marker" (i.e. another DNA sequence that is unique to a particular 

22 individual. The nucleic acid sequences described herein can further be used to provide 

23 polynucleotide reagents e.g. labeled or labelable probes, which can be used in, for 

24 example, an in situ hybridization technique, to identify a specific tissue. This technique 

25 can be exceedingly useful in cases where a forensic pathologist is presented with a tissue 

26 of unknown origin. Panels of such probes can be used to identify tissue by species and/or 

27 organ type. 

28 O. Predictive Medicine 

29 Portions or fragments of the polynucleotide sequences of the invention can be 

30 used for predictive purposes to thereby treat an individual prophylactically. 
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1 1. Diagnostic /Prognostic Assays 

2 One method of detecting the presence or absence of a polypeptide or nucleic acid 

3 in a biological sample is to expose that sample to an agent that recognizes the entity in 

4 question. A preferred agent for detecting mRNA or genomic DNA is a labeled nucleic 

5 acid probe capable of hybridizing to the sequence one is attempting to detect (for 

6 instance, the sequence of the invention). The nucleic acid probe can be, for example, a 

7 full length cDNA, or a portion thereof such as an oligonucleotide of at least 15, 30, 50, 

8 100, 250, or 500 nucleotides in length and sufficient to specifically hybridize under 

9 stringent conditions to a mRNA or genomic DNA encoding the invention. The term 

10 "labeled" in this context refers to modifications in said sequences including, but not 

1 1 limited to, biotin labeling that can then be detected with a fluorescently labeled 

12 streptavidin, or 32 P labeling. 

13 A preferred agent for detecting a polypeptide of the invention is an antibody or 

14 peptide capable of binding to the invention, preferably an antibody with a detectable 

15 label. Antibodies can be polyclonal or more preferably, monoclonal. An intact antibody, 

16 or a fragment thereof (e.g. a Fab or F(ab)2) can be used. The term "labeled" in this 

17 context refers to direct labeling of the probe or antibody by coupling (i.e. physical 

18 linking) a detectable substance to the probe or antibody, such as a fluorescent labeled 

19 moiety or biotin. 

20 The detection methods of the invention can be used to detect mRNA, protein, or 

21 genomic DNA in a biological sample in vitro as well as in vivo. For example, in vitro 

22 techniques for detection of mRNA include (but are not limited to) Northern Blot 

23 hybridization and in situ hybridizations. In vitro techniques for detection of a 

24 polypeptide of the invention include enzyme linked immunosorbent assays (ELISA's), 

25 Western blots, immunoprecipitations, and immunofluorescence. 

26 The invention also encompasses kits for detecting the presence of a polypeptide or 

27 nucleic acid of the invention in a biological sample. Such kits can be used to determine if 

28 a subject is suffering from or is at increased risk of developing a disorder associate with 

29 aberrant expression of a polypeptide or polynucleotide of the invention. For instance, the 

30 kit can comprise a labeled compound or agent (as well as all the necessary supplementary 

31 agents needed for signal detection e.g. buffers, substrates, etc. . .) capable of detecting the 
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1 polypeptide, or mRNA in the sample (e.g. an antibody which binds the polypeptide or a 

2 oligonucleotide probe that binds to DNA or mRNA encoding the polypeptide). 

3 The methods of the invention can also be used to detect genetic lesions or 

4 mutations in a gene of the invention, thereby determining if a subject with the lesioned 

5 gene is at risk for a disorder characterized by aberrant expression or activity of an agent 

6 of the invention. In preferred embodiments, the methods include detecting the presence 

7 or absence of a genetic lesion or mutation characterized by at least one alteration 

8 affecting the integrity of the agent of the invention. For example, such genetic lesions or 

9 mutations can be detected by ascertaining the existence of at least one of: 1) a deletion of 

10 one or more nucleotides from a gene; 2) an addition of one or more nucleotides to a gene; 

11 3) a substitution of one or more nucleotides of the gene; 4) a chromosomal rearrangement 

12 of the gene; 5) an alteration in the level of a messenger RNA transcript of the gene; 6) an 

13 aberrant modification of the gene, such as of the methylation pattern of the genomic 

14 DNA; 7) the presence of a non-wild type splicing pattern of a messenger RNA; 8) a non- 
15 wild type level of the protein encoded by the gene; 9) an allelic loss of the gene; and 10) 

16 an inappropriate post translational modification of the protein encoded by the gene. 

17 Many techniques can be used to detect lesions such as those described above. For 

18 instance, mutations in a selected gene from a sample can be identified by alterations in 

19 restriction enzyme cleavage patterns. In this procedure, sample and control DNA is 

20 isolated, digested with one or more restriction endonucleases, and fragment length sizes 

21 (determined by gel electrophoresis) are compared. Observable differences in fragment 

22 length sizes between sample and control DNA indicates mutations in the sample DNA. 

23 Additional techniques that can be applied to detecting mutations include, but are not 

24 limited to, detection based on direct sequencing, PCR-based detection of deletions, 

25 inversions, or translocations, detection based on mismatch cleavage reactions (Myers, 

26 R.M. et al. (1985) "Detection of single base substitutions by ribonuclease cleavage at 

27 mismatches in RNA:DNA duplexes." Science 230:1242), and detection based on altered 

28 electrophoretic mobility (e.g. SSCP, see, for example, Orita, M. et al. (1989) "Detection 

29 of polymorphisms of human DNA by gel electrophoresis as single-strand conformation 

30 polymorphisms." PNAS 86:2766). 
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1 2. Pharmacogenetics 

2 Pharmacogenetics deals with clinically significant hereditary variation in the 

3 response to drugs due to altered drug disposition and altered action in affected persons 

4 (see Iinder, M.W. et al. (1997) "Pharmacogenetics: a laboratory tool for optimizing 

5 therapeutic efficiency." Clin Chem. 43(2):254-266). In general, two types of 

6 pharmacogenetic conditions can be differentiated. There are genetic conditions 

7 transmitted as a single factor altering the way drugs act on the body, referred to as 

8 "altered drug action". Alternatively, there are genetic conditions transmitted as single 

9 factors altering the way the body acts on drugs (referred to as "altered drug metabolism"). 

10 These two conditions can occur either as rare defects, or as polymorphisms. For 

1 1 example, glucose-6-phosphate dehydrogenase deficiency is a common inherited 

12 enyzmopathy in which the main clinical complication is haemolysis after ingestion of 

13 oxidant drugs (e.g. anti-malarials, sulfonamides etc.). 

14 The activity of drug metabolizing enzymes is a major determinant of both the 

1 5 intensity and duration of drug action. The discovery of genetic polymorphisms of drug 

16 metabolizing enzymes (e.g. N-acetyltransferase 2 (NAT2) and cytochrome P450 enzymes 

17 (CYP2D6 and CYP2C 19) has provided an explanation as to why some patients do not 

1 8 obtain the expected drug effects or show exaggerated drug response and serious toxicity 

19 after taking the standard and safe dose of a drag. These polymorphisms are expressed in 

20 two phenotypes in the population, the extensive metabolizer (EM) and poor metabolizer 

21 (PM). The prevalence of PM is different among different populations. For example, the 

22 gene coding for CYP2D6 is highly polymorphic and several mutations have been 

23 identified in PM which all lead to the absence of functional CYP2D6. Poor metabolizers 

24 of this sort quite frequently experience exaggerated drug response and side effects when 

25 they receive standard doses. If a metabolite is the active therapeutic moiety, a PM will 

26 show no therapeutic response, as demonstrated for the analgesic effect of codeine 

27 mediated by its CYP2D6-formed metabolite morphine. At the other extreme are the so- 

28 called ultra rapid metabolizer who do not respond to standard doses. Recently, the 

29 molecular basis of ultra rapid metabolism has been identified to be due to CYP2D6 gene 

30 amplification. 
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1 Thus the in the context of pharmacogenetics, an agent of the invention can be 

2 used to determine or select appropriate agents for therapeutic prophylactic treatment of 

3 the individual. In addition, pharmacogenetic studies can be used to apply genotyping of 

4 polymorphic alleles encoding drug-metabolizing enzymes to the identification of an 

5 individuals drug responsiveness phenotype. 

6 3. Monitoring of Effects During Clinical Trials 

7 Monitoring the influence of agents that effect the expression or activity of a 

8 polypeptide or polynucleotide of the invention can be applied in clinical trials. For 

9 example, the effectiveness of a drug directed toward a target identified by the invention 

10 and intended to treat a particular ailment, can be monitored in clinical trials of subjects 

1 1 exhibiting said ailment by monitoring the level of gene expression of the target, activity 

12 of the target, or levels of the target of the invention. Thus in a preferred embodiment, the 

1 3 present invention provides a method for monitoring the effectiveness of treatment of a 

14 subject with an agent by comprising the steps of (i) obtaining a pre-administration sample 

15 from a subject prior to administration of the agent; (ii) detecting the level of the 

16 polypeptide or polynucleotide of the invention in the pre-administration sample; (iii) 

17 obtaining one or more post-administration samples from the subject; (iv) detecting the 

1 8 level or activity of said target of the invention in the post-administration samples, (v) 

19 comparing the level of said target of the invention in the post administration sample with 

20 levels in the pre-administration samples, and (vi) altering the administration of the agent 

21 to the subject accordingly. 
22 



23 Examples 

24 The following examples are intended to further illustrate certain preferred 

25 embodiments of the invention, and are not limiting in nature. 
26 

27 Example 1: Cell lines 

28 

29 S4535/HEK293 cells were propagated as monolayers in DMEM media (Gibco 

30 BRL) supplemented with 10% FBS, L-Glutamine (2mM final), non-essential amino acids 

3 1 (IX), Sodium Pyruvate (ImM), 300 ng/ml puromycin (ICN Biomedicals; Costa Mesa, 
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1 CA) and 300 |wg/ml neomycin (Life Technologies; Gaithersburg, MD). The colorectal 

2 adenocarcinoma HT-29 cells and DID 1-1 cells were both obtained from ATCC and 

3 grown in McCoy's 5 A media (Gibco BRL) modified with 10% FBS (Hyclone; Logan, 

4 UT). The packaging cell line, gp293 (a gift from Dr. Inder Verma, Salk Institute, CA) 

5 was maintained in DMEM, 10% fetal calf serum, and 200 jig/ml blasticidin (ICN 

6 Biomedicals; Costa Mesa, CA). HUVECs (Human umbilical vein epithelial cells, 

7 Clonetics Walkersville, MD) were maintained in EBM-2 basal media supplemented with 

8 a Bulletkit (Clonetics). HMECs (Human mammary epithelial cells, Clonetics) were 

9 maintained in DFCI-1 media (see Band, V. et al (1989) "Distinctive traits of normal and 

10 tumor-derived human mammary epithelial cells expressed in a medium that supports 

1 1 long-term growth of both cell types." PNAS 86: 1249-53). S W620 cells were maintained 

12 in DMEM (Life Technologies) enriched with 10% fetal calf serum. Cultures were grown 

13 at 33°C or 37 °C (5% C0 2 ) in standard tissue culture flasks. In some instances, Pen/Strep 

14 (IX, lOOug/ml ea.) was added to the cultures to minimize the risk of bacterial 

15 contamination. 

16 Example 2: Construction of the TBE-2 Reporter 

17 The vector (pBV-LUC) carrying the P-catenin /Tcf responsive promoter element 

18 consisting of 4 tandem repeats of the TBE-2 cassette was provided by the B. Vogelstein, ( 

19 Johns Hopkins University). To construct a retroviral reporter vector, pB V-LUC was 

20 digested with Mlul/Bgin to remove the TBE-2 promoter and blunted using Klenow 

21 fragment. Subsequently, this fragment was ligated into the filled Clal/BamHI sites of 

22 pVT806. As a result of these procedures, the product (pVT806-TBE2-EGFP) contains a 

23 TBE-2 promoter that is operably linked to coding sequence of EGFP and can be 

24 introduced into HEK293 cells using standard retroviral techniques. 



25 

26 Example 3: Isolation and Construction of WT and 

27 Mutant P-catenin Expression Vectors 

28 p-catenin was derived from PCR amplification of cDNA prepared in-house. 



29 Specifically, total RNA prepared from HEK293 or HT29 cells was used to construct 

30 cDNA by methods common to the field (Superscript, LifeTechnologies). Subsequently 

3 1 the N-terminal and C-terminal halves of p-catenin were amplified separately using 
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1 catenin-specific primers containing flanking restriction sites. (N-terminal primers =OVT 

2 1801 5' CGCGGATCCGGCTACTCAAGCTGATTTGATGGAG and 1802 5'AGTC 

3 GTGGAATGGCACCCTGCTCAC C-terminal primers 1803 5'CTCCACAACC 

4 TTTTATTACATCAAG and 1 804 5 'TCCCCCGGGGCC AATC AC AATGCAAGT 

5 TCAGACA; PCR conditions, i) 5\ 94°C; ii) 40", 94°C; iii) 40", 57°C; iv) 2\ 72°C; v) 

6 10', 72°C, cycle through steps ii-iv 30 times, TAQ polymerase). The PCR products were 

7 subsequently purified (Qiagen PCR Kit), digested with the appropriate enzymes, and 

8 ligated into pCMV-TAG-3 A. After validating the sequence of each half of the gene, N- 

9 and C-terminal clones were spliced together to form a full-length cDNA (Figure 9). 

10 In order to construct an allele of p-catenin that was insensitive to regulation by 

1 1 APC, a synthetic allele carrying a substitution of tyrosine for serine at position 45 (S45Y) 

12 was constructed using QuikChange (Stratagene, see Kunkel, T.A. (1985) "Rapid and 

13 efficient site-specific mutagenesis without phenotypic selection." PNAS 82:488). 

14 Specifically, the full-length (3-catenin clone was combined with Pfu Turbo and two 

15 complementary oligonucleotides encoding the required nucleotide change (OVT 

16 1 831,1832). The sample was then subjected to temperature cycling to allow 

17 incorporation of the mutated primers followed by digestion with Dpnl (37°C for 1 hr.) to 

18 cut the parental DNA template. The resulting nicked, double-stranded molecules were 

19 then transformed into bacteria (DH5a cells) for repair of the newly mutated vector. The 

20 full-length cDNA of p-catenin S45Ywas subsequently cloned into the SacH/Clal sites of 

21 the retroviral vector pVT312 and prepared for packaging in 293gp cells. 
22 

23 Example 4: Isolation and Construction of WT and 

24 Mutant TCF-4 Expression Vectors 

25 Using techniques similar to those described above, Tcf-4 was PCR amplified from 

26 a HEK293 cDNA library. Specifically, the N- and C-terminal halves of Tcf-4 were 

27 constructed separately using TCF-4 specific primers flanked with either BamHI/PstI or 

28 Pstl/EcoRI restriction sites (OVT 1805 5' CGCGGATCCGATGCCGCAGCTGAA 

29 CGGCGGTGGAG, 1806 5' TCTACGTCTGCAGGTAAGTGTGGAGGTG GGTTTC, 

30 and 1807 5'CTTACCTGCAGACGTAGACCCCAAAACAGGA, 1808 5' 

3 1 CAGCGGAATTC ACGACGCTAAAGCTATTCTA). The N-terminal PCR product 
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1 yielded two products of approximately 605 bp in size. Sequence analysis subsequently 

2 identified the lower of the two bands as being the true N-terminus of Tcf . In contrast, 

3 PCR reactions designed to amplify the C-terminal region of Tcf yielded a minimum of 

4 three products that were later identified as splice forms Tcf-4B, Tcf-4DE, and Tcf-4DIE. 

5 The PCR products of both reactions were subsequently purified (Qiagen PCR Kit), 

6 digested with the appropriate enzymes, and ligated into pCMV-Tag-3A. Individual 

7 clones were then sequenced to confirm identity and then spliced together to form a full- 

8 length cDNA using standard molecular biological techniques (Figure 10). Upon 

9 confirmation of intactness of each construct, the cDNA' s were ligated into the S aciI/Clal 

10 sites of retroviral expression vector, pVT3 12. 

1 1 To create N-terminal deletions of Tcf that were capable of blocking the activity of 

12 p-catenin S45Y, PCR primers that initiated amplification at internal sites were used to 

13 isolate truncated Tcf cDNA's. Specifically, OVT 1826, which annealed 90 nucleotides 

14 from the N-terminus of Tcf was used in conjunction with the T7 primer to amplify a 

15 cDNA that lacked the N-terminal-most 30 amino acids. The resulting fragment was then 

16 cloned into pVT312 using techniques common to the art. In addition, the deletion 

17 mutants were also fused in-frame to the C-terminus of either the Gal4 BD or dGFP 

18 cDNA. To accomplish this, oligos specific to either Gal4 (OVT 1845, 1846) or dGFP 

19 (OVT 1848 and 1849) were used to PCR amplify the correct product from pFA-cJun and 

20 pVT352.1 respectively (Cycling conditions 5', 94°C; 40", 94°C; 40", 58°C; 1', 72°C; 

21 (cycle 16-20 times) 10', 72°C). Each of the fragments was then gel purified, digested 

22 with SacH and ligated (T4 ligase) into the equivalent site of pVT 312 carrying the Tcf- 

23 4DI deletion mutant. Restriction digest mapping was then used to identify which of the 

24 clones contained the scaffolding molecule in the correct orientation. . 

25 Example 5: Selection or a Reporter Cell Line 

26 As a first step in identifying a reporter cell line, each of the necessary vectors 

27 including i) pVT806 TBE2-EGFP and ii) pVT3 12- p-catenin S45Y were packaged in 

28 293gp cells using variations of one of two methods. In the first technique, the two 

29 constructs are co-transduced with VSV-G envelope expression plasmid into 293gp 

30 packaging cells (gift of I. Verma, Salk Institute) using LIPOFECT AMINE (life 

31 Technologies). To accomplish this, 3 x 10 6 cells of the packaging cell line (293gp) are 
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1 seeded into a T175 flask. On the following day, two tubes are prepared, one containing 

2 15 ug of pVT806 TBE2-EGFP DNA (or pVT312- p-catenin S45Y DNA) plus 10 ug of 

3 envelope plasmid (pCMV-VSV.G-bpa) in 1.5 ml DMEM (serum free), and the second 

4 containing 100 ug of LEPOFECT AMINE in 1 .5 ml DMEM (serum free). These tubes are 

5 incubated separately at room temperature for 30 minutes, mixed, and incubated for an 

6 additional 30 minutes. This cocktail is referred to as the "transfection mix." The 293gp 

7 cells are then gently washed with serum free media and exposed to 20 ml of the 

8 transfection mix for 4 hours at 37 °C. The overlying mixture is then removed, the cells 

9 are washed once in DMEM (containing 10% serum) and then cultured in the same media. 

10 After 72 hours at 37°C the media (now referred to as "viral supernatant") is collected, 

1 1 filtered through a 0.45 ^im filter and frozen at -80°C. When needed, it is possible to make 

12 a second collection of virus by adding 15mls of DMEM (10% serum) back to the cells 

13 and incubating a further 24 hours. 

14 As an alternative methodology, retroviral DNA can be packaged using a 

15 technique that is referred to herein as the "CaCl 2 Method". In one variation of this 

16 method, 5 x 10 6 cells of the packaging cell line (293gp) are seeded into a 15cm 2 flask on 

17 Day 1. On the following day, the media is replaced with 22.5 mis of modified DMEM. 

1 8 Subsequently, a single tube carrying 22.5)ig of either pVT806 TBE2-EGFP (or pVT3 12- 

19 p-catenin S45Y) and 22.5 jag of envelope expression plasmid (pCMV-VSV.G-bpa) is 

20 brought to 400|il with dH 2 0, and 100(xl of CaCl 2 (2.5M) plus 500|il of BBS (drop- wise 

21 addition, 2x solution = 50mM, BES (N,N-bis(2-hydroxyethyl)-2-aminoethane-suIfonic 

22 acid), 280mM NaCl, L5mM Na 2 HP0 4 , pH 6.95) is added. After allowing this retroviral 

23 mixture to sit at room temperature for 5-10 minutes, it is introduced to the 293gp cells in 

24 a drop-wise fashion, and the cells are then incubated at 37°C (3% C0 2 ) for 16-24 hours. 

25 The media is then replaced and the cells are allowed to incubate for an additional 48-72 

26 hours at 37°C. At that time, the media containing the viral particles is then collected, 

27 filtered through a 0.45|i filter and frozen down at -80°C. Retroviral supernatant can 

28 subsequently be thawed and used directly to infect the appropriate host cells. 

29 Viral supernatants of both pVT806 TBE2-EGFP and pVT312- p-catenin S45Y were then 

30 used to transduce HEK293 cells (ATCC # CRL-1573). Following standard procedures 

31 common to the art, a population of roughly 1 X 10 7 HEK293 cells were transduced 
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1 sequentially with each of the retroviral supernatants for 24 hours, using a 20% vol/vol of 

2 retroviral supernatant to complete media (KBM catalogue no.CC3101, Clonetics) plus 

3 2% FBS. The cells were then allowed to recover for 24 hours in complete media 

4 followed by incubation in media containing the appropriate selectable marker (puromycin 

5 (0.3 \ig/ml) and neomycin (600 ug/ml) to select cells containing stable inserts of the two 

6 constructs. 

7 Fluorescent activated cell sorting (FACS) was subsequently used to identify 

8 individual clones capable of responding to the activation of the (3-catenin-Tcf pathway 

9 (Coulter EPICS Elite Cell Sorter using EXPO "Build" and EXPO "Analysis" software). 

10 In the initial starting population (F0), a bi-modal histogram containing two overlapping 

11 populations, one dim and one weakly fluorescent, were observed. (Figure 11). Using 

12 standard sorting procedures, cells exhibiting a weak-bright fluorescent profile were 

13 collected (referred to herein as Fl). When this population was expanded and reexamined 

14 by FACS, the proportion of bright and dim cells was observed to be more heavily skewed 

15 toward a bright peak. The brightest 5.6% of this population was then collected and 

16 expanded (referred to herein as F2). Subsequent FACS analysis of the expanded F2 

17 population showed that the bimodal nature of the histogram was nearly absent and that 

18 the vast majority of the cells were moderately to highly fluorescent. The top 1 1.9% of 

19 this population (referred to herein as F3) was then sorted, and plated in 96 well plates at 

20 low density to isolate individual clones with highly fluorescent properties. Forty-five 

21 clones isolated from these procedures were then expanded into 6 cm plates for further 

22 analysis. Upon FACS analysis, 6 of these clones were observed to show sufficient levels 

23 of fluorescence to warrant further investigation into the possibility of their use as reporter 

24 constructs. 

25 To identify a reporter clone that could be modulated by the action of 

26 perturbagens, the retroviral construct, pVT312-dGFP-ATcf-30, carrying the dominant 

27 negative form of Tcf 4DIE fused to dGFP, was introduced into a sample of cells taken 

28 from each of the six clones. After three days, representatives carrying all three constructs 

29 were analyzed by FACS to determine which of the clones were responsive to the 

30 dominant negative inhibitor. Clones that failed to respond to the presence of TcfA30 (i.e. 

31 those that remained brightly fluorescent) were discarded. One clone, S4535, exhibited 
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1 the sort of properties that are desirable in a reporter construct. In the absence of TcfA30, 

2 >95% of the S4535 cells fell into the bright peak (Figure 12). In contrast, when the 

3 S4535 clone was transfected with the dominant negative form of Tcf, a bimodal 

4 histogram containing near equal numbers of bright and dim cells was observed, 

5 suggesting that S4535 is responsive to the presence of agents that disrupt the p-catenin- 

6 Tcf-APC pathway. Subsequently, the original parental clone carrying the (3-catenin S45 Y 

7 and TBE2-EGFP constructs, was expanded for future perturbagen screens. 

8 Example 6: Preparation and Transfer of a cDNA Library 

9 Using techniques that are familiar to individuals in the art, randomly primed 

10 cDNA libraries were used as a source of sequences encoding putative (3-catenin/Tcf 

1 1 pathway blocking agents. As one non-limiting example of how to construct such a 

□ 12 library, polyA mRNA derived from placental tissue was PCR amplified using a random 

13 9-mer linked to a unique Sfil sequence ("SfiA"), followed by an additional set of 

a 14 nucleotides that is used later for library amplification (OVT 906: 5' ACTCTGGACTAG 

3 15 GCAGGTTCAGTGGCCATTATGGCC(N) 9 ). The product of this reaction was size 

16 selected (>400 base pairs) and subjected to RNAse A/H treatment to remove the original 

17 RNA template. The remaining single stranded DNA was then subjected to a second 
I*. 18 round of PCR using a random hexamer nucleotide sequence linked to a second unique 

19 Sfil sequence ("SfiB") which was again followed by an additional set of nucleotides for 

O 20 future library amplification: (OVT 908: 5' AAGCAGTGGTGTCAACGCAGTGAGGCC 

21 GAGGCGGCC (N) 6 ). The final product of this reaction, a double stranded cDNA, was 

22 blunted/filled with Klenow Fragment (New England BioLabs), size selected, PCR 

23 amplified (OVT 909: 5' ACTCTGGACTAGGCAGGTTCAGT and OVT 910: 5' 

24 AAGCAGTGGTGTCAACGCAGTGA), digested with Sfil (New England BioLabs), and 

25 inserted into a retroviral vector (pVT 352. 1 , pBabe). As a result of these procedures, the 

26 sequences encoding the perturbagens were inserted at the 3' end of the non-fluorescent 

27 variant of EGFP (dEGFP). Expression of the dEGFP-perturbagen fusion gene (as well 

28 as the neomycin resistance gene present in the retroviral vector) was driven by the 5' 

29 LTR of pBabe. The library (-12 x 10 6 in size) was then packaged in 293gp cells 

30 (laboratory of I. Verma, Salk Institute) and retroviral supernatant was generated. 
31 
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1 Example 7 ; Screening for Perturbagens that Inhibit 

2 the Activation of the TBE-2-EGFP Reporter 

3 To identify perturbagens that disrupted the (3-catenin /Tcf pathway, the placental- 

4 derived perturbagen expression library was introduced into 20 x 10 6 S4535 cells using 

5 standard retroviral transduction techniques. Subsequently, the cells were cultured for 5 

6 days and sorted to identify individuals within the population that exhibited decreased 

7 levels of GFP expression. In the first round of sorting, 14 x 10 6 cells of the population 

8 were collected. Genomic DNA was prepared from these cells and the perturbagen 

9 encoding insert was PCR amplified for sublibrary preparation. Specifically, the DNA 

10 encoding the perturbagens was PCR amplified from genomic DNA using two 

1 1 oligonucleotides that contained homology with sequences flanking the cDNA insertion 

12 site (oVT 181: 5' GGATCACTCTCGG CATGGACGAG and oVT 178: 5' 

13 ATTTTATCGATGTTAGCTTGGCCATT). One microgram of genomic DNA was added 

14 to each PCR reaction (along with the appropriate reagents to give lx PCR cocktail, e.g. 

15 2.5mM MgCh, lOmM oligos and 0.5-lmM dNTP and (Taq) polymerase (Boehinger 

16 Mannheim). Total PCR reaction volumes varied between 20 and 50ul/reactions and PCR 

17 cycling conditions followed the protocol of: 94°C, 2 minutes; 94°C 15 seconds; 68°C, 2 

18 min and 30 sec (cycle 24 x); 68°C 3 minutes. The PCR product was then purified 

19 (Qiagen PCR purification kit), digested with Sfil (New England Biologicals) and 

20 directionally ligated (T4 ligase, Boehinger Mannheim) into the original vector 

21 (pVT352. 1). This material was then transformed into bacteria by electroporation 

22 (DH10B, Electromax, Gibco) and inoculated into 500 ml of LB-Amp media for selection 

23 and expansion of bacterial cells that contained a member of the sublibrary. The bacterial 

24 cells were harvested at log phase of cell growth, and prepared (Qiagen) for 

25 isolation/purification of large quantities of the cDNA sublibrary. Subsequently, this 

26 material was re-packaged in 293gp cells in preparation for subsequent rounds of cycling 

27 and enrichment in S4535. 

28 Following the second round of cycling, the population that fell within the dim 

29 gate was collected, replated, and expanded. These cells were then sorted (without 

30 sublibrary construction) to facilitate further enrichment of the cell population containing 
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1 perturbagen sequences. Rounds three and four included sublibrary preparation and 

2 proceeded in a similar fashion to round one, 

3 The fraction of HEK293 cells expressing lower levels of GFP changed 

4 dramatically over the course of the 4 rounds of cycling/enrichment. In the early rounds 

5 of screening, the number of dim cells observed in the S4535 library-containing 

6 populations mirrored the percentage observed in control studies (5-8%). In contrast, by 

7 the end of 4 rounds of cycling/enrichment procedures, the number of dim cells had grown 

8 to 34.4 % (2.7 x over the observed background levels). Nearly one thousand library 

9 clones were picked from the fourth round of cycling, and screened individually using a 

10 high throughput screening procedure, to identify i) perturbagens that were capable of 

11 suppressing the expression of GFP in S4535 cells, and ii) agents that exhibited cytostatic 

12 or cytotoxic properties in secondary cell lines. 



13 Example 8: High Throughput Screening 

14 1. Isolation of Individual Perturbagen Clones for High- 
i 5 Throughput Screens. 

16 Following multiple successive rounds of en-masse screening, the perturbagen 

17 encoding sequences were PCR amplified from the resultant sublibrary, recloned en masse 



18 into the appropriate retroviral vector (e.g. pVT 352.1 or pVT1515) transformed into 

19 bacteria (DH10B Electromax, Gibco) by electroporation, and plated on selective agar 

20 plates (LB, Amp + ) to identify individual clones that contained potential perturbagen 

21 inserts. To prepare purified plasmids of individual perturbagen clones, bacterial colonies 

22 were picked (either by hand or automated techniques, AutogenSys, Autogen) and grown 

23 overnight in a 96 well plate format (LB, +Amp media, 37°C). Samples from each well 



24 were then removed and frozen down as glycerol stocks which were later thawed, grown 

25 in liquid culture, and processed for plasmid DNA preparation.(Multiscreen Filtration 

26 Plates, Millipore). 

27 2. ht preparation of viral supernatants and transduction 

28 Procedures 

29 To obtain viral supernatants for HT screening procedures, 2 x 10 5 early passage 

30 293gp cells (in 180 ul of media) were plated into each well of a 96 well microtiter dish 
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1 using either automated (Sorval "Cytomat" and a Beckman "Multimek" instrumentation) 

2 or non-automated techniques. The cells were incubated overnight to allow attachment to 

3 the solid support and then transfected with the individual retroviral miniprep DNA's. 

4 Several viable methods can be used to transfect cells in this format (e.g CaC^ 

5 .Lipofectamine, Transit™). In the CaCl 2 method, 133ng of library plasmid DNA was 

6 mixed with 534ng of envelope plasmid in a total volume of 5 pi. CaC^ was then added 

7 (5pl) to a final concentration of 250 mM, followed by addition of an equal volume (lOpl) 

8 of 2x BBS (50mM BES(N,N-bis(2-hydroxyethyl)-2-aminothane-sulfonic acid), 280 mM 

9 NaCl, 1.5 mM Na 2 HP0 4 , pH6.95). The solution was mixed every 5 minutes for 20 

10 minutes before adding 20pl dropwise to the wells containing the gp293 cells. The cells 

1 1 were allowed to incubate 16 hours at 37°C, and the media was replaced with lOOpl of 

12 fresh media. At 72 hours post transfection, the supernatant was removed to an empty 96 

13 well plate, exposed to multiple freeze/thaw cycles (or filtration) to remove potential 

14 contaminant 293gp cells, and then frozen at -80°C for storage. 

15 Transductions were performed by plating each cell line (e.g. S4535, HT29) in a 

16 microti ter plate in a total volume of 100 ul media. The following day, each retroviral 

17 supernatant was thawed, filtered through a 0.45 um Multiscreen-HV sterile filter plate 

18 (Millipore Corp., Bedford, MA) and added to the cells along with polybrene (4ug/ml). In 

19 most instances, the viral supernatant represented 50% of the volume of the final mixture. 

20 3. HT TransFACS Screen for Modulators of TBE2-GFP Expression in HEK293 

21 Cells, 

22 To test the ability of individual perturbagens to down-regulate the TBE2-EGFP 

23 reporter construct, three hundred S4535 cells were plated in each well of a 96-well format 

24 and allowed to attach overnight. Subsequently, a single viral supernatant (85ul) and 

25 polybrene (final concentration of 4ug/ml) was added to each well and allowed to incubate 

26 for 16 hours. Following viral infection, the media was replaced with 200ul of fresh 

27 culture media. All of the above mentioned procedures were performed by hand or made 

28 use of the Sorval "Cytomat" and a Beckman "Multimek" instrumentation. 

29 Perturbagen expressing S4535 cells were cultured for 6 days at 37°C (5% CO2) before 

30 being analyzed by FACS. To prepare the cells for analysis, each well was washed lx 

31 with PBS and then treated with trypsin (50ul of a 0.05% solution. + 53mM EDTA, 10 



84 



1 minutes, 37°C, Life Technologies; Gaithersburg, MD) to release the cells from the 

2 surface of the well. Subsequently, 150 ul of DMEM + 10% FCS was added to each well 

3 to neutralize the trypsin and samples were then analyzed on the FL1 channel of a Coulter 

4 Epics XL-MCL (Beckman Coulter; Fullerton, CA) using EXPO software and an 

5 automated 32-position sample carousel. 

6 Using the procedures described above, nine hundred and fifty-seven clones 

7 (obtained after four rounds of TransFACS screening, en mass) were tested for their 

8 ability to alter GFP expression levels in the S4535 clonal line. Of the roughly 1,000 

9 clones analyzed, roughly 8% (76 clones) were judged to be positive (i.e. the perturbagen 

10 inhibited expression of GFP) based on positive and negative controls previously 

1 1 described (e.g. pVT1515 and pVT312-dGFP-ATcf~30). When the DNA sequences of 

12 these clones were matched with the FACS data, all the positive clones were found to be 

13 fragments of various forms of cadherin including cad5, cadi 1, and cadherin 6 (a member 

14 of the type II cadherins, see Shimoyama, Y. et al. (1995) "Isolation and sequence analysis 

15 of human cadherin-6 complementary DNA for the full coding sequence and its 

16 expression in human carcinoma cells." Cancer Res. 15;55(10):2206-11). The sequences 

17 of the cadherin clones selected in the P-catenin assay were similar (see Figures 13-15). 

1 8 All comprised C-terminal fragments representing the cytoplasmic domains of the native 

19 molecules fused to GFP. The smallest fragment, GFP-cadll, was 93 amino acids (not 

20 including GFP). Each cadherin sequence included the region known to interact with P~ 

21 catenin (Huber and Weis, 2001) and exhibited a penetrance that was similar to TCF4A30 

22 (see Figure 16). 

23 4. Testing Cad5 C-terminal Fragment for Differential Cytostatic and Cytotoxic 

24 Activity using a HT assay 

25 The cytotoxic and cytostatic activity of Cad perturbagens was assayed using a 

26 variation of the high-throughput techniques described in US No. 60/305,712, 

27 VEN012/00, "Automated Assay Methodology," the contents of which are incorporated in 

28 this document, in full. The procedures involved in this assay include i) transducing 

29 perturbagen encoding retroviral constructs into a suitable cell line(s) in a 96-well format, 

30 ii) culturing cells for a sufficient length of time necessary to observe the phenotype of 
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1 interest, and iii) assaying said cells for phenotype of interest (i.e. cell death and total cell 

2 number). 

3 In one non-limiting technique, retroviral transductions were performed by plating 

4 roughly 10 3 cells of each line in a microtiter well of a 96 well plate (total volume = 100 \xl 

5 media) and allowing the cells to attach over the course of several hours (or overnight). 

6 Subsequently, the retroviral supernatants were added to the cells along with polybrene 

7 (4|al/ml). In most instances, the viral supernatant represented 50% of the volume of the 

8 final mixture. In another transduction format, 84[il of filtered viral supernatant was 

9 combined with lOOjul of cells seeded the previous day in black plastic walled, clear 

10 bottom 96- well plate(s). The seeding density of the plates varied, depending on the cell 

1 1 type, and was determined empirically as the density that would produce an approximately 

12 80% confluent culture by six days post-transduction. The number of seeded cells were as 

13 follows: HT29 = 500 cells/well; HEK293 = 300 cells/well; primary HMEC = 2,000 

14 cells/well; primary HUVEC = 200-1000 cells/well, depending on the passage. Polybrene 

15 was also added to the transduction to a final concentration of 4jig/ml. Approximately 16 

16 hours post-transduction, the media was changed, and incubation was allowed to continue 

17 at37°C. 

18 To determine the cytotoxic/cytostatic effects of a given perturbagen, cells 

19 transduced with the agent of choice were analyzed to determine the number of dead and 

20 live cells remaining in the well after a given period of incubation. There are several 

21 methods that can be used to quantitate the number of dead and/or dying cells. As one 

22 non-limiting example, the ensuing procedures are followed: five days post-transduction, 

23 Sytox Orange (Molecular Probes, Eugene, OR) is added to each well of the assay plate to 

24 a final concentration of l|nM. The plates are then allowed to incubate for 30 minutes at 

25 37°C, and then analyzed on a CCD imaging system (Sytox Orange, Ex:535+/-15nm, Em 

26 585 +/-20nm) to determine the number of cells having a compromised cell membrane 

27 (i.e. dead and/or dying cells). In this instance, the imaging system is composed of a 

28 PixelVision Spectra Video™ Series imaging camera (1 100 x 330 back-illuminated array, 

29 Pixel Vision, Tigurd, OR), Pixel Vision PixelView™ 3.03 software, two 50mm/f2 

30 Olympus macro focusing lens mounted front to front, four 20750 Fostec xenon light 

31 sources (Schott-Fostec, Auburn, NY), four 8589 Fostec light lines, a 4457 Daedal stage, 
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1 and supporting mechanical fixtures. Mechanical fixtures were constructed to position 

2 the PixelVision camera below a microliter dish. Additionally, the fixtures mounted the 

3 four Fostec light lines and allowed the excitation light to be focused on the viewed area 

4 of the microliter dish. A 5 10 nm filter was placed between the two lenses. The front-to- 

5 front lens configuration provides 1 : 1 magnification and close placement of the target 

6 object to the imaging system. 

7 After the dead-cell count is executed, a total cell number is determined by adding 

8 a detergent, e.g. saponin or tween-20, to each well to permeablize the remaining cells 

9 (0. 1 %, 30 minutes). Subsequently, an additional readout is performed to determine the 

10 total well fluorescence, (an indirect measure of the total number of cells). The number of 

1 1 dead and live cells in each well is then compared with the appropriate controls to 

12 determine if any cytotoxic/cytostatic properties are associated with the agent under study. 

13 To investigate the effects of the Cad5 C-terminal fragment on cell viability and 

14 growth, a pVT15 15-dGFP-Cad5 plasmid was introduced into four separate cell types 

15 (HT29, HEK293, HuVEC, and HMEC) using the techniques described above. The HT29 

16 colon carcinoma cell line contains an inactivating mutation in APC (and hence a 

17 constitutively activated p-catenin pathway). In contrast, the transformed kidney cell line, 

18 HEK293, and two primary cultures (HMECs and HUVECs) contain a wild type 

19 (regulated) p-catenin pathway. In concurrent, parallel experiments, the parent plasmid, 

20 pVT1515, and a positive control plasmid (pTcfDN) were also transduced into the test 

21 cells. Measurements of dead cell numbers and total cell fluorescence were then collected 

22 on all samples using the Sytox staining procedure. In a single experiment, a microtiter 

23 plate with 8 replicates per clone was used, and multiple plates were analyzed. In all, 

24 between 24 and 97 replicates of each construct were examined. 

25 Looking at cell death as the first key parameter, analysis show that like TcfDN, the C- 

26 terminal fragment of Cad5 exhibits a strong cytotoxic effect in HT29 cells (Figure 17). 

27 These results are in contrast with the observations made in cell lines carrying an intact 

28 beta-catenin pathway. In HEK 293 cells, HMECs, and HUVECs, the cytotoxic effects of 

29 the Cad5 perturbagen are considerably smaller and approach the numbers induced by 

30 introduction of the parental plasmid, pVT1515. Analysis of total cell numbers in each of 

31 the cell lines further supports the notion that the C-terminal fragment of Cad5 has a more 
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1 deleterious effect in cells containing a disrupted P-catenin/APC pathway (Figure 18). The 

2 over-expression of the Cad5 perturbagen caused a significant reduction in the number of 

3 HT29 cells, yet did not substantially alter the growth of the other cells tested. This is in 

4 contrast to the dominant negative Tcf construct that did not make an observable alteration 

5 in the numbers in HT29 cells, but decreased cell numbers in all non-colon lines. These 

6 data demonstrate that the Cad5 C-terminal fragment embodies both cytotoxic and 

7 cytostatic properties. Furthermore, the effects of Cad 5 are accentuated in colon cancer 

8 cells containing defects in the [3-catenin/APC pathway, while exhibiting less noticeable 

9 consequences in other tissues having an intact pathway. 

10 5. Targets of the Cad5 Perturbagen 

1 1 Microarray expression studies were used to identify additional targets of the Cad5 

12 perturbagen. To determine which genes were induced (or repressed) in cells containing 

13 the Cad5 C-terminal fragment, the Cad5 agent was introduced into S4535 cells, and 

14 polyA RNA was isolated and readied as probes for microarrays. To accomplish this, 

15 S4535 cells were seeded into 96 well tissue culture plates (Becton-Dickinson; Bedford, 

16 MA) at 300 cells/well (in a total of lOOjil) one day prior to transduction. On the day of 

17 transduction, Cad5, TcfDN, or control (pVT1515) viral supernatants were thawed, and 85 

18 jal of supernatant was added to the wells containing S4535 cells. Polybrene was added to 

19 4(Ltg/ml to enhance infection. Sixteen hours post transduction, the media was changed, 

20 and the cells were allowed to continue incubation until 6 days post transduction. 

21 At six days post transduction, the media was removed, the cells were washed with PBS, 

22 and 50]il of a solution containing 0.05% trypsin plus 0.53mM EDTA (Life Technologies; 

23 Gaithersburg, MD) was added to each well. After ten minutes, the trypsin was neutralized 

24 by the addition of 150(il of DMEM with 10% fetal calf serum and a portion of the 

25 suspension was analyzed by fluorescent activated cell sorting (FACS, Coulter Epics XL- 

26 MCL; Beckman Coulter; Fullerton, CA) using an automated 32-position sample carousel. 

27 Results of these procedures demonstrated that the large majority (>95%, see Figure 19) of 

28 S4535 cells carrying either the Cad5 (or TcfDN) fragments had switched off reporter 

29 expression, thus confirming expression of the inhibitor. Total RNA and polyA RNA was 

30 then prepared from roughly 1x10 cells taken from each group (Rneasy, Oligotex, 

31 Qiagen; Valencia, CA). The intactness of each sample was then inspected by agarose gel 
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1 electrophoresis and quantitated by spectrophotometer prior to performing differential 

2 display analysis (Incyte Genomics, St.Louis, MO). 

3 To analyze the data from the roughly 9,000 human genes and ESTs that were 

4 present on each microarray slide, the statistical analysis proceeded using the method of 

5 Kamb, A., "A simple method for statistical analysis of intensity differences in 

6 microarray-derived gene expression data" (submitted for publication). Briefly, the 

7 internal controls accompanying the Incyte data files were removed to allow data 

8 manipulation. Intensity differences (S1-S2), and average signals ((Sl+S2)/2) ) for the 

9 pVT15 1 5 control RNA (in separate hybridizations) were then calculated for each 

10 analogous gene pair on the slide. These columns were then sorted (in descending order ) 

1 1 based on the average signal and averaging window sizes were tested. After settling on 

12 100 data points as the window, an averaged (SI +S2)/2 incremented by one point each 

13 time was calculated along with <rand o 2 values for the corresponding sets of 100 points 

14 in the intensity column. Polynomials and lines were fit to plots of avg. (Sl+S2)/2 

15 vs. a J- 12. These fits were used to compute a s 2 values at each signal intensity. Sequences 

16 with very low average signals ((SI +S2)/2 < 400) corresponding to 20% of the dataset 

17 were removed. To calculate a z statistic, each AS value for the inhibitor RNA - control 

18 (pVT1515) was divided by its corresponding <tas (the distribution mean, was 

19 approximately zero, and was not subtracted from AS to find z). 

20 Analysis of the microarray experiments comparing Cad5CD and TcfDN revealed 

21 highly correlated datasets, suggesting that both agents affected S4535 cells in a similar 

22 fashion. Interestingly, though there were many statistically significant differences in 

23 expression (when compared with control studies) the largest single change was only 2.3- 

24 fold (the Dickkopf-1 gene) suggesting the phenotypic similarities between Cad5CD (and 

25 TcfDN) were related to a large number of relatively small alterations in gene expression. 

26 It should be noted, however, that dynamic range compression of microarray data, cell-to- 

27 cell variation in inhibitor expression level, and population averaging of the cellular 

28 response at the RNA level could cause underestimation of the ratios in cells and/or their 

29 biological significance. 

30 A group of genes whose expression levels differed significantly from controls (\z\ 

31 > 3; p < 0.01) were identified in Cad5-perturbagen containing cells. Interestingly, many 
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1 of these Cad5CD targets exhibited similar patterns of gene expression in both TcfDN- 

2 containing and Cad5CD-containing cells, suggesting a common effect of Cad5CD and 

3 TcfDN on gene expression in HEK293 cells (see Figure 20). Still other genes were found 

4 to be modulated only in cells carrying the Cad5CD perturbagen (see Figure 21). Amongst 

5 the genes identified were several growth-related genes including an NDP kinase, and 

6 tyrosine phosphatase epsilon, both of which have been implicated in tumor formation 

7 (Elson, A. (1999) "Protein tyrosine phosphatase epsilon increases the risk of mammary 

8 hyperplasia and mammary tumors in transgenic mice." Oncogene, 18: 7535-42; and 

9 Nakayama, T., et al. (1992) "Expression in human hepatocellular carcinoma of 

10 nucleoside diphosphate kinase, a homologue of the nm23 gene product." /. Natl. Cancer 

1 1 Inst., 84: 1349-54). In addition two genes related to interferon response were found to be 

12 modulated. A transforming sequence from adenovirus (Elb) was also present among the 

13 repressed genes in both datasets, a potentially significant observation given the role of 

14 adenovirus in creating the HEK293 cell line (Nakayama, T. et al. (1992). "Expression in 

1 5 human hepatocellular carcinoma of nucleoside diphosphate kinase, a homologue of the 

16 nm23 gene product. " /. Natl. Cancer Inst., 84: 1349-54). 

17 Genes with known roles in the P-catenin pathway were also observed amongst the 

18 target group. Importantly, cyclin Dl — known to be repressed by TcfDN — was in the 

19 category of down-regulated genes in Cad5 containing lines. In addition, axin-2 (also 

20 known as conductin), which acts upstream in the p-catenin pathway to inhibit p-catenin 

21 function (Nakamura, T., et al. (1998) "Axin, an inhibitor of the Wnt signalling pathway, 

22 interacts with beta-catenin, GSK-3beta and APC and reduces the beta-catenin level. " 

23 Genes Cells, 3: 395-403; Kishida, S. et al. (1998) "Axin, a negative regulator of the wnt 

24 signaling pathway, directly interacts with adenomatous polyposis coli and regulates the 

25 stabilization of beta-catenin" J. Biol. Chem., 273: 10823-6.), was down-regulated, 

26 suggesting the existence of a compensatory feedback mechanism in the p-catenin 

27 pathway as a consequence of Cad5CD (and to a lesser extent TcfDN) inhibition. 

28 Cadherin-2, another negative regulator, was also repressed by Cad5CD. 
29 
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1 As is apparent to one of skill in the art, various modifications of the above 

2 embodiments can be made without departing from the spirit and scope of this invention. 

3 These modifications and variations are within the scope of this invention. 
4 



91 



